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EXPRESSION VECTORS FOR STIMULATING AN IMMUNE 
RESPONSE AND METHODS OF USING THE SAME 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application claims the benefit of 09/078,904, filed May 13, 1998, and 

60/085,751, filed May 15, 1998, both herein incorporated by reference in their entirety. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

10 This invention was made with government support under NIH Grant No. 

AI-42699-01, NIH Grant No. AI38584.03, and NIH Contract No. NOl-AI-45241. The 

Government has certain rights in this invention. 

FIELD OF THE INVENTION 
15 The present invention relates to nucleic acid vaccines encoding multiple 

CTL and HTL epitopes and MHC targeting sequences. 

BACKGROUND OF THE INVENTION 
Vaccines are of fundamental importance in modem medicine and have 
20 been highly effective in combating certain human diseases. However, despite the 

successful implementation of vaccination programs that have greatly limited or virtually 
eliminated several debilitating human diseases, there are a number of diseases that affect 
millions worldwide for which effective vaccines have not been developed. 

Major advances in the field of immunology have led to a greater 
25 understanding of the mechanisms involved in the immune response and have provided 
insights into developing new vaccine strategies (Kuby, Immunology, 443-457 (3rd ed., 
1997), which is incorporated herein by reference). These new vaccine strategies have 
taken advantage of knowledge gained regarding the mechanisms by which foreign 
material, termed antigen, is recognized by the immune system and eliminated firom the 
30 organism. An effective vaccine is one that elicits an immune response to an antigen of 
interest. 

Specialized cells of the immune system are responsible for the protective 
activity required to combat diseases. An immune response involves two major groups of 
cells, lymphocytes, or white blood cells, and antigen-presenting cells. The purpose of 
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these immune response cells is to recognize foreign material, such as an infectious 
organism or a cancer cell, and remove that foreign material from the organism. 

Two major types of lymphocytes mediate different aspects of the immune 
response. B cells display on their cell surface specialized proteins, called antibodies, that 

5 bind specifically to foreign material, called antigens. Effector B cells produce soluble 
forms of the antibody/ which circulate throughout the body and function to eliminate 
antigen from the organism. This branch of the immune system is known as the humoral 
branch. Memory B cells function to recognize the antigen in future encounters by 
continuing to express the membrane-bound form of the antibody. 

10- A second major type of lymphocyte is the T cell. T cells also have on their 

cell surface specialized proteins that recognize antigen but, in contrast to B cells, require 
that the antigen be bound to a specialized membrane protein complex, the major 
histocompatibility complex (MHC), on the surface of an antigen-presenting cell. Two 
major classes of T cells, termed helper T lymphocytes ("HTL") and cytotoxic T 

15 lymphocytes ("CTL"), are often distinguished based on the presence of either CD4 or 
CDS protein, respectively, on the cell surface. This branch of the immune system is 
known as the cell-mediated branch. 

The second major class of immune response cells are cells that function in 
antigen presentation by processing antigen for binding to MHC molecules expressed in 

20 the antigen presenting cells. The processed antigen bound to MHC molecules is 

transferred to the surface of the cell, where the antigen-MHC complex is available to bind 
to T cells. 

MHC molecules can be divided into MHC class I and class II molecules 
and are recognized by the two classes of T cells. Nearly all cells express MHC class I 

25 molecules, which function to present antigen to cytotoxic T lymphocytes. Cytotoxic T 
lymphocytes typically recognize antigen bound to MHC class I. A subset of cells called 
antigen-presenting cells express MHC class II molecules. Helper T lymphocytes 
typically recognize antigen bound to MHC class II molecules. Antigen-presenting cells 
include dendritic cells, macrophages, B cells, fibroblasts, glial cells, pancreatic beta cells, 

30 thymic epithelial cells, thyroid epithelial cells and vascular endothelial cells. These 

antigen-presenting cells generally express botli MHC class I and class II molecules. Also, 
B cells function as both antibody-producing and antigen-presenting cells. 

Once a helper T lymphocyte recognizes an antigen-MHC class II complex 
on the surface of an antigen-presenting cell, the helper T lymphocyte becomes activated 

- 2 - 
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and produces growth factors that activate a variety of cells involved in the immune 
response, including B cells and cytotoxic T lymphocytes. For example, under the 
influence of growth factors expressed by activated helper T lymphocytes, a cytotoxic T 
lymphocyte that recognizes an antigen-MHC class I complex becomes activated. CTLs 
5 monitor and eliminate cells that display antigen specifically recognized by the CTL, such 
as infected cells or tumor cells. Thus, activation of helper T lymphocytes stimulates the 
activation of both the humoral and cell-mediated branches of the immune system. 

An important aspect of the immune response, in particular as it relates to 
vaccine efficacy, is the manner in which antigen is processed so that it can be recognized 
10 by the specialized cells of the immune system. Distinct antigen processing and 

presentation pathways are utilized. The one is a cytosolic pathway, which results in the 
antigen being bound to MHC class I molecules. An alternative pathway is an 
endoplasmic reticulum pahtway, which bypasses the cytosol. Another is an endocytic 
pathway, which results in the antigen being bound to MHC class II molecules. Thus, the 
15 cell surface presentation of a particular antigen by a MHC class II or class I molecule to a 
helper T lymphocyte or a cytotoxic T lymphocyte, respectively, is dependent on the 
processing pathway for that antigen. 

The cytosolic pathway processes endogenous antigens that are expressed 
inside the cell. The antigen is degraded by a speciahzed protease complex in the cytosol 
• 20 of the cell, and the resulting antigen peptides are transported into the endoplasmic 
reticulum, an organelle that processes cell surface molecules. In the endoplasmic 
reticulum, the antigen peptides bind to MHC class I molecules, which are then 
transported to the cell surface for presentation to cytotoxic T lymphocytes of the immune 
system. 

25 Antigens that exist outside the cell are processed by the endocytic 

pathway. Such antigens are taken into the cell by endocytosis, which brings the antigens 
into specialized vesicles called endosomes and subsequently to specialized vesicles called 
lysosomes, where the antigen is degraded by proteases into antigen peptides that bind to 
MHC class II molecules. The antigen peptide-MHC class II molecule complex is then 

30 transported to the cell surface for presentation to helper T lymphocytes of the inrunune 
system. 

A variety of factors must be considered in the development of an effective 
vaccine. For example, the extent of activation of either the humoral or cell-mediated 
branch of the immune system can determine the effectiveness of a vaccine against a 
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panicular disease. Furthermore, the development of immunologic memory by inducing 
memory-cell formation can be important for an effective vaccine against a particular 
disease (Kuby, supra). For example, protection from infectious diseases caused by 
pathogens with short incubation periods, such as influenza virus, requires high levels of 

5 neutralizing antibody generated by the humoral branch because disease symptoms are 
already underway before memory cells are activated. Alternatively, protection from 
infectious diseases caused by pathogens with long incubation periods, such as poho virus, 
does not require neutrializing antibodies at the tinie of infection but instead requires 
memory B cells that can generate neutrali2dng antibodies to combat the pathogen before it 

10 is able to infect target tissues. Therefore, the effectiveness of a vaccine at preventing or 
ameliorating the symptoms of a particular disease depends on the type of immune 
response generated by the vaccine. 

Man}- traditional vaccines have relied on intact pathogens such as 
attenuated or inactivated viruses or bacteria to elicit an immune response. However, 

1 5 these traditional vaccines have advantages and disadvantages, including reversion of an 
attenuated pathogen to a virulent form. The problem of reversion of an attenuated 
vaccine has been addressed by the use of molecules of the pathogen rather than the whole 
pathogen. For example, immunization approaches have begun to incorporate 
recombinant vector \ accines and synthetic peptide vaccines (Kuby, supra). Recently, 

20 DNA vaccines have also been used (Donnelly et ai, Annu, Rev. Immunol. 15:617-648 
(1997), which is incorporated herein by reference). The use of molecules of a pathogen 
provides safe vaccines that circumvent the potential for reversion to a virulent form of the 
vaccine. 

The targeting of antigens to MHC class II molecules to activate helper T 
25 lymphocytes has been described using lysosomal targeting sequences, which direct 

antigens to lysosomes, where the antigen is digested by lysosomal proteases into antigen 
peptides that bind to MHC class 11 molecules (U.S. Patent No, 5,633,234; Thomson et aL 
I Virol 72:2246-2252 (1998)). It would be advantageous to develop vaccines that 
deliver multiple antigens while exploiting the safety provided by administering individual 
30 epitopes of a pathogen rather than a whole organism. In particular, it would be 
advantageous to develop vaccines that effectively target antigens to MHC class II 
molecules for activation of helper T lymphocytes. 

Several studies also point to the crucial role of cytotoxic T cells in both 
production and eradication of infectious diseases and cancer by the immune system 
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(Byrne eM/., 1 Immunol 51:682 (1984); McMichael ei al, N. Engl. J. Med. 309:13 
(1983)). Recombinant protein vaccines do not reliably induce CTL responses, and the 
use of otherwise immunogenic vaccines consisting of attenuated pathogens in humans is 
hampered, in the case of several important diseases, by overriding safety concerns. In the 
5 case of diseases such as iUV, HBV, HCV, and malaria, it appears desirable not only to 
induce a vigorous CTL response, but also to focus the response against highly conserved 
epitopes in order to prevent escape by mutation and overcome variable vaccine efficacy 
against different isolates of the target pathogen. 

Induction of a broad response directed simultaneously against multiple 
10 epitopes also appears to be crucial for development of efficacious vaccines. HIV 
infection is perhaps the best example where an infected host may benefit from a 
multispecific response. Rapid progression of HIV infection has been reported in cases 
where a narrowly focused CTL response is induced whereas nonprogressors tend to show 
a broader specificity of CTLs (Goulder ei al, Nat. Med 3:212 (1997); Borrow et al, Nat. 
15 Med, 3:205 (1997)). The highly variable nature of HIV CTL epitopes resulting from a 
highly mutating genome and selection by CTL responses directed against only a single or 
few epitopes also supports the need for broad epitope CTL responses (McMichael et al. . 
Annu, Rev, Immunol 15:271 (1997)). 

One potential approach to induce multispecific responses against 
20 conserved epitopes is immunization with a minigene plasmid encoding the epitopes in a 
string-of-beads fashion. Induction of CTL, HTL, and B cell responses in mice by 
minigene plasmids have been described by several laboratories using constructs encoding 
as many as 11 epitopes {Anetal, J. Virol 71:2292 (1997); Thomson et aL / Immunol 
157:822 (1996); Whinon et al, 1 Virol 67:348 (1993); Hanke et al. Vaccine 16:426 
25 (1998); Vitiello et aL Eur. 1 Immunol 27:671-678 (1997)). Minigenes have been 

delivered in vivo by infection with recombinant adenovirus or vaccinia, or by injection of 
purified DNA via the intramuscular or intradermal route (Thomson et al, J, Immunol 
160:1717 (1998); Toes et aL Proa, Natl Acad, Scl USA 94:14660 (1997)). 

Successful development of minigene DNA vaccines for human use will 
30 require addressing certain fundamental questions dealing with epitope MHC affinity, 
optimization of constructs for maximum in vivo immunogenicity, and development of 
assays for testing in vivo potency of multi-epitope minigene constructs. Regarding MHC 
binding affinity of epitopes, it is not currently known whether both high and low affinity 
epitopes can be included within a single minigene construct, and what ranges of peptide 
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affinity are permissible for CTL induction in vivo. This is especially important because 
dominant epitopes can vary in their affinity and because it might be important to be able 
to deliver mixtures of dominant and subdominant epitopes that are characterized by high 
and low MHC binding affinities. 

5 With respect to minigene construct optimization for maximum 

immunogenicity in vivo, conflicting data exists regarding whether the exact position of 
the epitopes in a given construct or the presence of flanking regions, helper T cell 
epitopes, and signal sequences might be crucial for CTL induction (Del Val et al, Cell 
66: 1 145 (1991); Bergmann et ai, J. Virol 68:5306 (1994); Thomson et al, Proc. Natl 

10 Acad. ScL USA 92:5845 (1995); Shirai et aL J- Infect. Dis. 173:24 (1996); RahemtuUae^ 
ai, Nature 353:180 (1991); Jennings et al. Cell Immunol 133:234 (1991); Anderson et 
al, 1 Exp, Med. 174:489 (1991); Uger et al, 1 Immunol 158:685 (1997)). Finally, 
regarding development of assays that allow testing of human vaccine candidates, it should 
be noted that, to date, all in vivo immunogenicity data of multi-epitope minigene plasmids 

15 have been performed with murine class I MHC-restricted epitopes. It would be 
advantageous to be able to test the in vivo immunogenicity of minigenes containing 
human CTL ephopes in a convenient animal model system. 

Thus, there exists a need to develop methods to effectively deliver a 
variety of HTL (helper T lymphocyte) and CTL (cytotoxic T lymphocyte) antigens to 

20 stimulate an immune response. The present invention satisfies this need and provides 
related advantages as well. 

SUMMARY OF THE INVENTION 
The invention therefore provides expression vectors encoding two or more 

25 HTL epitopes fused to a MHC class II targeting sequence, as well as expression vectors 
encoding a CTL ephope and a universal HTL epitope fused to an MHC class I targeting 
sequence. The HTL epitope can be a universal HTL epitope (also referred to as a 
universal MHC class U epitope). The invention also provides expression vectors 
encoding two or more HTL epitopes fused to a MHC class 11 targeting sequence and 

30 encoding one or more CTL epitopes. The invention additionally provides methods of 

stimulating an immune response by administering an expression vector of the invention in 
vivo, as well as methods of assaying the human immunogenicity of a human T cell 
peptide epitope in vivo in a non-human mammal 



6 



wo 99/58658 



PCT/US99/10646 



In one aspect, the present invention provides an expression vector 
comprising a promoter operably linked to a first nucleotide sequence encoding a major 
histocompatibility (MHC) targeting sequence fused to a second nucleotide sequence 
encoding two or more heterologous peptide epitopes, wherein the heterologous peptide 
5 epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a universal 
HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
immune response in vivo comprising administering to a manunalian subject an expression 
vector comprising a promoter operably Unked to a first nucleotide sequence encoding a 
10 major histocompatibihty (MHC) targeting sequence fused to a second nucleotide 

sequence encoding two or more heterologous peptide epitopes, wherein the heterologous 
peptide epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a 
universal HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
15 immime response in vivo comprising administering to a mammahan subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
major histocompatibility (MHC) targeting sequence fused to a second nucleotide 
sequence encoding a heterologous human HTL peptide epitope. 

In another aspect, the present invention provides a method of assaying the 
20 human immunogenicity of a human T cell peptide epitope in vivo in a non-human 

mammal, comprising the step of admmistering to the non-human mammal an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
heterologous human CTL or HTL peptide epitope. 

In one embodiment, the heterologous peptide epitopes comprise two or 
25 more heterologous HTL peptide epitopes. In another embodiment, the heterologous 
peptide epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 
In another embodiment, the heterologous peptide epitopes fiarther comprise one to two or 
more heterologous CTL peptide epitopes. In another embodiment, the expression vector 
comprises both HTL and CTL peptide epitopes. 
30 In one embodiment, one of the HTL peptide epitopes is a universal HTL 

epitope. In another embodiment, the universal HTL epitope is a pan DR epitope. In 
another embodiment, the pan DR epitope has the sequence 
AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 
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In one embodiment, the peptide epitopes are hepatitis B virus epitopes, 
hepatitis C virus epitopes, human immunodeficiency virus epitopes, human papilloma 
virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, PAP epitopes, p53 
epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. In another 
5 embodiment, the peptide epitopes each have a sequence selected from the group 

consisting of the peptides depicted in Tables 1-8. In another embodiment, at least one of 
the peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

in one embodiment, the MHC targeting sequence comprises a region of a 
polypeptide selected from the group consisting of the li protein, LAMP-I, HLS-DM, 
10 HLA--D0, H2-D0, influenza matrix protein, hepatitis B surface antigen, hepatitis B virus 
core antigen, Ty panicle, Ig-a protein, Ig-p protein, and Ig kappa chain signal sequence. 

In one embodiment, the expression vector ftirther comprises a second 
promoter sequence operably Unked to a third nucleotide sequence encoding one or more 
heterologous HTL or CTL peptide epitopes. In another embodiment, the CTL peptide 
1 5 epitope comprises a structural motif for an HL A supertype, whereby the peptide CTL 
epitope binds to two or more members of the supertype with an affinity of greater that 
500 nM. In another embodiment, the CTL peptide epitopes have structural motifs that 
provide binding affinity for more than one HLA allele supertype. 

In one embodiment, the non-human mammal is a transgenic mouse that 
20 expresses a human HLA allele. In another embodiment, the human HLA allele is selected 
from the group consisting of Al 1 and A2.1. In another embodiinent, the non-human 
mammal is a macaque that expresses a human HLA allele. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 shows the nucleotide and amino acid sequences (SEQ ID NOS : 1 

and 2, respectively) of the BPADRE construct encoding a fusion of the murine li gene 
with a pan DR epitope sequence substituted for the CLIP sequence of the li protein. 

Figure 2 shows the nucleotide and amino acid sequences (SEQ ID N0S:3 
and 4, respectively) of the I80T construct encoding a fusion of the cytoplasmic domain, 
30 the transmembrane domain and part of the luminal domain of the li protein fused to 
multiple MHC class II epitopes. 

Figure 3 shows the nucleotide and amino acid sequences (SEQ ED NOS: 5 
and .6, respectively) of the liThfrill construct encoding a fiision of the cytoplasmic 
domain, transmembrane domain and a portion of the luminal dommn of the li protein 
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fused to multiple T helper epitopes and amino acid residues 101 to 215 of the li protein, 
which encodes the trimerization region of the li protein. 

Figure 4 shows the nucleotide and amino acid sequences (SEQ ID N0S:7 
and 8, respectively) of the KappaLAMP-Th construct encoding a fusion of the murine 
5 immunoglobulin kappa signal sequence fused to multiple T helper epitopes and the 
transmembrane and cytoplasmic domains of LAMP- 1. 

Figure 5 shows the nucleotide and amino acid sequences (SEQ ID N0S:9 
and 10, respectively) of the H2M-Th construct encoding a fusion of the signal sequence 
of H2rM fused to multiple MHC class II epitopes and the transmembrane and 
10 cytoplasmic domains of H2-M. 

Figure 6 shows the nucleotide and amino acid sequences (SEQ ED N0S:1 1 
and 12, respectively) of the H20-Th construct encoding a fusion of tiie signal sequence of 
H2-D0 fused to multiple MHC class n epitopes and the transmembrane and cytoplasmic 
domains of H2-D0. 

15 Figure 7 shows the nucleotide and amino acid sequences (SEQ ID N0S:13 

and 14, respectively) of the PADRE-Influenza matrix construct encoding a fusion of a 
pan DR epitope sequence fused to the amino-terminus of influenza matrix protein. 

Figure 8 shows the nucleotide and amino acid sequences (SEQ ID N0S:15 
and 16, respectively) of the PADRE-HBV-s construct encoding a fusion of a pan DR 
20 epitope sequence fused to the amino-terminus of hepatitis B virus surface antigen. 

Figure 9 shows the nucleotide and amino acid sequences (SEQ ID N0S:17 
and 1 8, respectively) of the Ig-alphaTh construct encoding a fusion of the signal sequence 
of the Ig-a protein fused to multiple MHC class II epitopes and the transmembrane and 
cytoplasmic domains of the Ig-a protein. 
25 Figure 10 shows the nucleotide and amino acid sequences (SEQ ED 

N0S:19 and 20, respectively) of the Ig-betaTh construct encoding a fusion of the signal 
sequence of the Ig-P protein fused to multiple MHC class II epitopes and the 
transmembrane and cytoplasmic domains of the Ig-p protein. 

Figure 1 1 shows the nucleotide and amino acid sequences (SEQ ID 
30 N0S:21 and 22, respectively) of the SigTh construct encoding a fusion of the signal 
sequence of tiie kappa immunoglobulin fused to multiple MHC class II epitopes. 

Figure 12 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:23 and 24, respectively) of human HLA-DR, the invariant chain (li) protein. 
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Figure 13 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:25 and 26, respectively) of human lysosomal membrane glycoprotein- 1 (LAMP-1). 

Figure 14 shows the nucleotide and amino acid sequences (SEQ E) 
NOS:27 and 28, respectively) of human HLA-DMB. 
5 Figure 1 5 shows the nucleotide and amino acid sequences (SEQ ID 

NOS:29 and 30, respectively) of human HLA-DO beta. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
N0S:3 1 and 32, respectively) of the human MB-1 Ig-a. 

Figure 17 shows the nucleotide and amino acid sequences (SEQ ID 
10 NOS:33 and 34, respectively) of human Ig-P protein. 

Figure 18 shows a schematic diagram depicting the method of generating 
some of the constructs encoding a MHC class 11 targeting sequence fused to multiple 
MHC class II epitopes. 

. Figure 19 shows the nucleotide sequence of the vector pEP2 (SEQ ID 

15 NO:35). 

Figure 20 shows the nucleotide sequence of the vector pMIN.O (SEQ ID 

NO:36). 

Figure 21 shows the nucleotide sequence of the vector pMIN. l (SEQ ID 

NO:37). 

20 Figure 22. Representative CTL responses in HLA-A2. l/K^-H-2^''' mice 

immunized with pMin.l DNA. Splenocytes from primed animals were cultured in 
triplicate flasks and stimulated twice in vitro with each peptide epitope. Cytotoxicity of 
each culture was assayed in a ^^Cr release assay against Jurkat-A2.1/K^ target cells in the 
presence (filled symbols, soUd lines) or absence (open symbols, dotted lines) of peptide. 

25 Each symbol represents the response of a single culture. 

Figure 23, Presentation of viral epitopes to specific CTLs by Jurkat- 
A2. l/K** tumor cells transfected with DNA minigene. Two constructs were used for 
transfection,.pMin;l and pMin.2-GFP. pMin.2-GFP-transfected targets cells were sorted 
by FACS and the population used in- this experiment contained 60% fluorescent cells. 

30 CTL stunulation was measured by quantitating the amount of IFN-y release (A, B) or by 
lysis of ^^Cr-labeled target cells (C, D, hatched bars). CTLs were stimulated with 
transfected cells (A, C) or with parental Jurkat~A2.1/K^ cells in the presence of l >g/ml 
peptide (B, D). Levels of IFN- y release and cytotoxicity for the different CTL lines in 
the absence of epitope ranged from 72- 1 26 pg/ml and 2-6% respectively. 
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Figure 24. Summary of modified minigene constructs used to address 
variables critical for in vivo immunogenicity. The following modifications were 
incorporated into the prototype pMin.l construct; A, deletion of PADRE HTL epitope; B, 
incorporation of the native HB V Pol 55 1 epitope that contains an alanine in position 9; C, 

5 deletion of the Ig kappa signal sequence; and D, switching position of the HBV Env 335 
and HBV Pol 455 epitopes. 

Figure 25. Examination of variables that may influence pMin. 1 
immunogenicity. In vivo CTL-inducing activity of pMin. 1 is compared to modified 
constructs. For ease of comparison, the CTL response induced by each of the modified 
.10. - DNA minigene constructs (shaded bars) is compared separately in each of the four pmiels 
to the response induced by the prototype pMin.l construct (solid bars). The geometric 
mean response of CTL-positive cultures firom two to five independent experiments are 
shown. Numbers shown with each bar indicate the number of positive cultures/total 
number tested for that particular epitope. The ratio of positive cultures/total tested for the 

15 pMin.l group is shown in panel A and is the same for the remaining Figure panels (see 
Example V, Materials and Methods, in vitro CTL cultures, for the defmition of a positive 
CTL culture). Theradigm responses were obtained by immunizing.animais with tlie 
lipopeptide and stimulating and testing splenocyte cultures with the HBV Core 18-27 
peptide. 

.20 

DEFINITIONS 

An "HTL" peptide epitopeor an "MHC II epitope" is an MHC class H 
restricted epitope, i.e., one that is bound by an MHC class II molecule. 

A "CTL" peptide epitope or an *TVIHC I epitope" is an MHC class I 
25 restricted epitope, i.e., one that is bound by an MHC class I molecule. 

An "MHC targeting sequence" refers to a peptide sequence that targets a 
polypeptide, e.g., comprising a peptide epitope, to a cytosolic pathway (e.g., an MHC 
class I antigen processing pathway), en endoplasmic reticulum pathwasy, or an eiidocytic 
pathway (e.g., an MHC class II antigen processing pathway). 
30 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from uru-elated genes 
arranged to make a new functional nucleic acid, e.g. , a promoter from one source and a 
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coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
each other in nature, e.g., a fusion polypeptide comprising subsequence from different 
polypeptides, peptide epitopes from the same polypeptide that are not naturally in an 
5 adj acent position, or repeats of a single peptide epitope, 

As used herein, the term "universal MHC class II epitope" or a "universal 
HTL epitope" refers to a MHC class II peptide q)itope that binds to gene products of 
multiple MHC class H alleles. For example, the DR. DP and DQ alleles are human MHC 
n alleles. Generally, a unique set of peptides binds to a particular gene product of a MHC 

10 class II allele. In conn-ast, a universal MHCclass H epitope is able . ,, 

products of multiple MHC class II alleles. A universal MHC class H epitope binds to 2 or 
more MHC class II alleles, generally 3 or more MHC class II alleles, and particularly 5 or 
more MHC class II alleles. Thus, the presence of a universal MHC class ti epitope in an 
expression vector is advantageous in that it functions to increase the number of allelic 
1 5 MHC class II molecules that can bind to the peptide and, consequently, the number of 
Helper T lymphocytes that are activated. 

Universal MHC class II epitopes are well known in the art and include, for 
example, epitopes such as the "pan OR epitopes," also referred to as "PADRE" 
(Alexander aL Immunity 1:751-761 (1994); WO 95/07707, USSN 60/036,713, USSN 
20 60/037,432, PCT.aJS98/01373, 09/009,953, and USSN 60/087,192 each of which is 

incorporated herein by reference). A "pan DR binding peptide" or a "PADRE" peptide of 
the invention is a peptide capable of binding at least about 7 different DR molecules, 
preferably 7 of the 12 most common DR molecules, most preferably 9 of the 12 most 
common DR molecules (DRl, 2w2b, 2w2a, 3, 4w4, 4wl4, 5, 7, 52a, 52b, 52c, and 53), or 
25 alternatively, 50% of a panel of DR molecules representative of greater than or equal to 
75% of the human population, preferably greater than or equal to 80% of the human 
population. Pan DR epitopes can bind to a number of DR alleles and are strongly 
immunogenic for T cells. For example, pan DR epitopes were found to be more effective 
at inducing an immune response than natural MHC class H epitopes (Alexander, supra). 
30 An example of a PADRE epitope is the peptide 

AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38) (for additional 
examples of PADRE epitopes, see Table 8 of TTC docket No. 018623-006221, filed May 
12, 1999, USSN .herein incorporated by reference in its entirety). 
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With regard to a particular amino acid sequence, an "epitope" is a set of 
amino acid residues which is involved in recognition by a particular immunoglobulin, or 
in the context of T cells, those residues necessary for recognition by T cell receptor 
proteins and/or Major Histocompatibility Complex (MHC) receptors. In an immune 
5 system setting, in vivo or in vitro, an epitope is the collective features of a molecule, such 
as primary, secondary and tertiary peptide structure, and charge, that together form a site 
recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this 
disclosure epitope and peptide are often used interchangeably. It is to be appreciated, 
however, that isolated or purified protein or peptide molecules larger than and comprising 
1 0 an epitope of the invention are still within the bounds of the invention. 

As used herein, "high affinity" with respect to HLA class I molecules is 
defined as binding with an IC50 (or Kd) of less than 50 nM. "Intermediate affinity" is 
binding with an IC50 (or Kd) of between about 50 and about 500 nM. "High affinity" 
with respect to binding to HLA class II molecules is defined as binding with an Kd of 
1 5 less than 100 nM. "Intermediate affinity" is binding with a Kd of between about 1 00 and 
about. l.OPO nM^ binding are described in detail, e.g., in PCT 

publications WO 94/20127 and WO 94/03205. Alternatively, binding is expressed 
. relatiYe to .a.reference peptide. As a particular assay ^^^^^^^^^ 

IC50S of the peptides tested may change somewhat. However, the binding relative to the 
20 reference peptide not significantly change. For example, in an assay run under 

conditions such that the IC50 of the reference peptide increases 10-fold, the IC50 values 
of the test peptides will also shift approximately 10-fold. Therefore, to avoid ambiguities, 
the assessment of whether a peptide is a good, intermediate, weak, or negative binder is 
generally based on its IC50, relative to the IC50 of a standard peptide. 
25 Throughout this disclosure, results are expressed in terms of "IC50s." 

IC50 is the concentration of peptide in a binding assay at which 50% inhibition of binding 
of a reference peptide is observed. Given the conditions in which the assays are run (i.e., 
limiting HLA proteins and labeled peptide concentrations), these values approximate KD 
values. It should be noted that IC50 values can change, often dramatically, if the assay 
30 conditions are varied, and depending on the particular reagents used (e.g., HLA 

preparation, etc.). For example, excessive concentrations of HLA molecules will increase 
the apparent measured IC50 of a given ligand. 

The terms "identical" or percent "identity," in the context of two or more 
peptide sequences,, refer to two or more sequences or subsequences that are the same or 
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have a specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using a 
sequence comparison algorithms using default program parameters or by manual 
alignment and visual inspection. 
5 The phrases "isolated" or "biologically pure" refer to material which is 

substantially or essentially free from components which normally accompany the material 
as it is found in its native state. Thus, isolated peptides in accordance with the invention 
preferably do not contain materials normally associated with the.peptides in their in situ 
environment. 

1 0 "Major histocompatibility complex" or "MHC" is a cluster of genes that 

plays a role in control of the cellular interactions responsible for physiologic immune 
responses. In humans, the MHC complex is also known as the HLA complex. For a 
detailed description of the MHC and HLA complexes, see Paul, Fundamental 
Immunology (ixd od. 1993). 

15 "Human leukocyte antigen" or "HLA" is a human class I or class II major 

histocompatibility complex (MHC) protein (see, e.g., Stites, et <s\.. Immunology, (8th ed., 
1994). 

An "HLA supertype or family", as used herein, describes sets of HLA 
molecules grouped on the basis of shared peptide-binding specificities. HLA class I 

20 molecules that share somewhat similar binding affinity for peptides bearing certain amino 
acid motifs are grouped into HLA supertypes. The terms HLA superfamily, HLA 
supertype family, HLA family, and HLA xx-like supertype molecules (where xx denotes 
a particular HLA type), are synonyms. 

The term "motif refers to the pattern of residues in a peptide of defined 

25 length, usually a peptide of from about 8 to about 1 3 amino acids for a class I HLA motif 
and from about 6 to about 25 amino acids for a class II HLA motif, which is recognized 
by a particular HLA molecule. Peptide motifs are typically different for each protein 
encoded by each human HLA allele and differ in the pattern of the primary and secondary 
anchor residues. 

30 A "supermotif is a peptide binding specificity shared by HLA molecules 

encoded by two or more HLA alleles. Thus, a preferably is recognized with high or 
intermediate affinitv- (as defmed herein) by two or more HLA antigens. 

"Cross-reactive binding" indicates that a peptide is bound by more than 
one HLA molecule; a synonym is degenerate binding. 
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The tenii "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other, typically by peptide bonds between the a-amino and carboxyl groups of 
adjacent amino acids. The preferred CTL-inducing oUgopeptides of the invention are 13 
5 residues or less in length and usually consist of between about 8 and about 1 1 residues, 
preferably 9 or 10 residues. The preferred HTL-inducing oligopeptides are less than 
about 50 residues in length and usuaUy consist of betweMi about 6 and about 30 residues, 
more usually between about 12 and 25, and often between about 1 5 and 20 residues. 

An "immunogenic peptide" or "peptide epitope" is a peptide which 
10 comprises an allele-speclfic motif or supermotif such that the peptide will bind an HLA 
molecule and induce a CTL and/or HTL response. Thus, immunogenic peptides of the 
invention are capable of binding to an appropriate HLA molecule and thereafter inducing 
a cytotoxic T cell response, or a helper T cell response, to the antigen fi-om which the 
immunogenic peptide is derived. 
15 A "protective immune response" refers to a CTL and/or an HTL response 

to an antigen derived from an infectious agent or a tumor antigen, which prevents or at 
least partially arrests disease symptoms or progression. The immune response may also 
include an antibody response which has been facihtated by the stimulation of helper T 
cells. 

20 The term "residue" refers to an amino acid or amino acid mimetic 

incorporated into an oligopeptide by an amide bond or amide bond mimetic. 

"Synthetic peptide" refers to a peptide that is not naturally occurring, but is 
man-made using such methods as chemical synthesis or recombinant DNA technology. 
The nomenclature used to describe peptide compounds follows the 

25 conventional practice wherein the amino group is presented to the left (the N-terminus) 
and the carboxyl group to the right (the C-terminus) of each amino acid residue. When 
amino acid residue positions are referred to in a peptide epitope they are numbered in an 
amino to carboxyl direction with position one being the position closest to the amino 
terminal end of the epitope, or the peptide or protein of which it may be a part. In the 

30 formulae representing selected specific embodiments of the present invention, the amino- 
and carboxyl-terminal groups, although not specifically shown, are in the form they 
would assume at physiologic pH values, unless otherwise specified. In the amino acid 
structure formulae, each residue is generally represented by standard three letter or single 
letter designations. The L-forra of an amino acid residue is represented by a capital single 
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letter or a capital firs: letter of a three-letter symbol, and the D-form for those amino acids 
having D-forms is represented by a lower case single letter or a lower case three letter 
symbol. Glycine has no asymmetric carbon atom and is simply referred to as "Gly" or G. 
As used herein, the term "expression vector" is intended to refer to a 
5 nucleic acid molecule capable of expressing an antigen of interest such as a MHC class I 
or class II epitope in an appropriate target cell. An expression vector can be, for example, 
a plasmid or vmis, including DNA or RNA viruses. The expression vector contains such 
a promoter element to express an antigen of interest in the appropriate cell or tissue in 
order to stimulate a desired immune response. 

10 

DETAILED DESCRIPTION OF THE INVENTION 
Cytotoxic T lymphocytes (CTLs) and helper T lymphocytes (HTLs) are 
critical for immunit>- against infectious pathogens; such as viruses, bacteria, and protozoa; 
tumor cells; autoimmunne diseases and the like. The present invention provides 
1 5 minigenes that encode peptide epitopes which induce a CTL and/or HTL response. The 
minigenes of the im ention also include an MHC targeting sequence. A variety of 
minigenes encoding different epitopes can be tested for immunogenicity using an HLA 
transgenic mouse. The epitopes are typically a combination of at least two or more HTL 
epitopes, or a CTL epitope plus a universal HTL epitope, and optinally include additional 
20 HTl and/or CTL epitopes. Two, three, four, five, six, seven, eight, nine, ten, twenty, 
thirty, forty or about fifty different epitopes, either HTL and/or CTL, can be included in 
the minigene, along with the MHC targeting sequence. The epitopes can have different 
HLA restriction. Epitopes to be tested include those derived fix)m viruses such as HIV, 
HBV, HCV, HSV, CMV, HPV, and HTLV; cancer antigens such as p53, Her2/Neu, 
25 MAGE, PSA, human papilloma virus, and CEA; parasites such as Trypanosoma, 

Plasmodium, Leishmania, Giardia, Entamoeba; autoimmune diseases such as rheumatoid 
arthritis, myesthenia gravis, and lupus erythematosus; ftmgi such as Aspergillus and 
Candida; and bacteria such as Escherichia coli. Staphylococci. Chlamydia. Mycobacteria. 
Streptococci, and Pseudomonas. The epitopes to be encoded by the minigene are selected 
30 and tested using the methods described in published PCT applications WO 93/07421, WO 
94/02353, WO 95/01000, WO 97/0445 1, and WO 97/05348, herein incorporated by 
reference. 
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HTL and CTL Epitopes 

The expression vectors of the invention encode one or more MHC class II 
and/or class I epitopes and an MHC targeting sequence. Multiple MHC class 11 or class I 
q)itopes present in an expression vector can be derived from the same antigen, or the 
MHC epitopes can be derived from different antigens. For example, an expression vector 
can contain one or more MHC epitopes that can be derived from two different antigens of 
the same virus or from two different antigens of different viruses. Furthermore, any 
MHC epitope can be used in the expression vectors of the invention. For example, any 
single MHC epitope or a combination of the MHC epitopes shown in Tables 1 to 8 can be 
used in the expression vectors of the invention. Other peptide epitopes can be selected by 
one of skill in the art, e.g., by using a computer to select epitopes that contain HLA allele- 
specific motifs or supermotifs. The expression vectors of the invention can also encode 
one or more universal MHC class II epitopes, e.g., PADRE (see, e.g., SEQ ID NO:38 and 
Table 8 of TTC docket No. 018623-006221, filed May 12, 1999. USSN 

___). 

Universal MHC class II epitopes can be advantageously combined with 
other MHC class I and class II epitopes to increase the number of cells that are activated 
in response to a given antigen and provide broader population coverage of MHC-reactive 
alleles. Thus, the expression vectors of the invention can encode MHC epitopes specific 
for an antigen, universal MHC class II epitopes, or a combination of specific MHC 
epitopes and at least one universal MHC class II epitope. 

MHC class I epitopes are generally about 5 to 15 amino acids in length, in 
particular about 8 to 1 1 amino acids in length. MHC class II epitopes are generally about 
10 to 25 amino acids in length, in particular about 13 to 21 amino acids in length. A 
MHC class I or II epitope can be derived from any desired antigen of interest. The 
antigen of interest can be a viral antigen, surface receptor, tumor antigen, oncogene, 
enzyme, or any pathogen, cell or molecule for which an immune response is desired. 
Epitopes can be selected based on their ability to bind one or multiple HLA alleles, and 
can also be selected using the "analog" technique described below. 

Targeting Sequences 

The expression vectors of the invention encode one or more MHC epitopes 
operably linked to a MHC targeting sequence. The use of a MHC targeting sequence 
enhances the immune response to an antigen, relative to delivery of antigen alone, by 
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directing the peptide epitope to the site of MHC molecule assembly and transport to the 
cell surface, thereby providing an increased number of MHC molecule-peptide epitope 
complexes available for binding to and activation of T cells. 

MHC class I targeting sequences are used in the present invention, e.g., 
those sequences that target an MHC class I epitope peptide to a cytosolic pathway or to 
the endoplasmic reticulum {see, e.g., Rammensee etai, Immunogenetics 41:178-228 
(1995)). For example, the cytosolic pathway processes endogenous antigens that are 
expressed inside the cell. Although not wishing to be bound by any particular theory, 
cytosolic proteins are thought to be at least partially degraded by an endopeptidase 
activity of a proteasome and then transported to the endoplasmic reticulum by the TAP 
molecule (transporter associated with processing). In the endoplasmic reticulum, the 
antigen binds to MHC class 1 molecules. Endoplasmic reticulum signal sequences bypass 
the cytosolic processing pathway and directly target endogenous antigens to the 
endoplasmic reticulum, where proteolytic degradation into peptide fragments occurs. 
Such MHC class I targeting sequences are well known in the art, and include, e.g., signal 
sequences such as those from Ig kappa ,tissue plasminogen activator or insulin. A 
preferred signal peptide is the human Ig kappa chain sequence. Endoplasmic reticulum 
signal sequences can also be used to target MHC class II epitopes to the endoplasmic 
reticulum, the site of MHC class I molecule assembly. 

MHC class II targeting sequences are also used in the invention, e.g., those 
that target a peptide lo the endocytic pathway. These targeting sequences typically direct 
extracellular antigens to enter the endocytic pathway, which results in the antigen being 
transferred to the lysosomal compartment where the antigen is proteolytically cleaved 
into antigen peptides for binding to MHC class 11 molecules. As with the normal 
processing of exogenous antigen, a sequence that directs a MHC class II epitope to the 
endosomes of the endocytic pathway and/or subsequently to lysosomes, where the MHC 
class n epitope can bmd to a MHC class II molecule, is a MHC class 11 targeting 
sequence. For example, group of MHC class II targeting sequences useful in the 
invention are lysosomal targeting sequences, which localize polypeptides to lysosomes. 
Since MHC class II molecules typically bind to antigen peptides derived from proteolytic 
processing of endocytosed antigens in lysosomes, a lysosomal targeting sequence can 
function as a MHC class II targeting sequence. Lysosomal targeting sequences are well 
known in the art and include sequences found in the lysosomal proteins LAMP-1 and 
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LAMP-2 as described by August et al (XJ.S. Patent No. 5,633,234, issued May 27, 1997), 
which is incorporated herein by reference. 

Other lysosomal proteins that contain lysosomal targeting sequences 
include HLA-DM. HLA-DM is an endosomal/lysosomal protein that functions in 
facilitating binding of antigen peptides to MHC class II molecules, Since it is located in 
the lysosome, HLA-DM has a lysosomal targeting sequence that can function as a MHC 
class II molecule targeting sequence (Copier et al, 1 Immunol 157:1017-1027 (1996), 
which is incorporated herem by reference). 

The resident lysosomal protein HLA-DO can also function as a lysosomal 
targeting sequence. In contrast to the above described resident lysosomal proteins 
LAMP-1 and HLA-DM, which encode specific Tyr-containing motifs that target proteins 
to lysosomes, HLA-DO is targeted to lysosomes by association with HLA-DM (Liljedahl 
et al. EMBO 1 1 5 :48 1 7-4824 ( 1 996)), which is incorporated herein by reference. 
Therefore, the sequences of HLA-DO that cause association with HLA-DM and, 
consequently, translocation of HLA-DO to lysosomes can be used as MHC class II 
targeting sequences. Similarly, the murine homolog of HLA-DO, H2-D0, can be used to 
"derive a MHC class II targeting sequence. A MHC class II epitope can be fused to HLA- 
DO or H2-D0 and targeted to lysosomes. 

In another example, the cytoplasmic domains of B cell receptor subunits 
Ig-a and Ig-P mediate antigen internalization and increase the efficiency of antigen 
presentation (Bonnerot et al. Immunity 3:335-347 (1995)), which is incorporated herein 
by reference. Therefore, the cytoplasmic domains of the Ig-a and Ig-P proteins can 
function as MHC class II targeting sequences that target a MHC class II epitope to the 
endocytic pathway for processing and binding to MHC class II molecules. 

Another example of a MHC class II targeting sequence that directs MHC 
class II epitopes to the endocj^ic pathway is a sequence that directs polypeptides to be 
secreted, where the polypeptide can enter the endosomal pathway. These MHC class U 
targeting sequences that direct polypeptides to be secreted mimic the normal pathway by 
which exogenous, extracellular antigens are processed into peptides that bind to MHC 
class II molecules. Any signal sequence that functions to direct a polypeptide through the 
endoplasmic reticulum and ultimately to be secreted can function as a MHC class II 
targeting sequence so long as the secreted polypeptide can enter the endosomal/lysosomal 
pathway and be cleaved into peptides that can bind to MHC class II molecules. An 



- 19 - 



wo 99/58658 



PCT/US99/10646 



example of such a fusion is shown in Figure 11, where the signal sequence of kappa 
immunoglobulin is fused to multiple MHC class II epitopes. 

In another example, the li protein binds to MHC class II molecules in the 
endoplasmic reticulum, where it functions to prevent peptides present in the endoplasmic 
5 reticulum from binding to the MHC class II molecules. Therefore, fusion of a MHC class 
II epitope to the li protein targets the MHC class II epitope to the endoplasmic reticulum 
and a MHC class II molecule. For example, the CLIP sequence of the li protein can be 
removed and replaced with a MHC class II epitope sequence so that the MHC class II 
epitope is directed to the endoplasmic reticulum, where the epitope binds to a MHC class 

10 II molecule, . . 

In some cases, antigens themselves can serve as MHC class II or I 
targeting sequences and can be fused to a universal MHC class II epitope to stimulate an 
immune response. Although cytoplasmic viral antigens are generally processed and 
presented as complexes with MHC class I molecules, long-lived cytoplasmic proteins 

15 such as the influenza matrix protein can enter the MHC class II molecule processing 

pathway (Gueguen & Long, Proa Natl Acad, Set USA_ 93:14692-14697 (1996)), which 
is incorporated herein by reference. Therefore, long-lived cytoplasmic proteins can 
function as a MHC class II targeting sequence. For example, an expression vector 
encoding influenza matrix protein fused to a universal MHC class II epitope can be 

20 advantageously used to target influenza antigen and the universal MHC class H epitope to 
the MHC class II pathway for stimulating an immune response to influenza. 

Other examples of antigens functioning as MHC class 11 targeting 
sequences include polypeptides that spontaneously form particles, the polypeptides are 
secreted from the cell that produces them and spontaneously form particles, which are 

25 taken up into an antigen-presenting cell by endocytosis such as receptor-mediated 

endocytosis or are engulfed by phagocytosis. The particles are proteolytically cleaved 
into antigen peptides after entering the endosomal/lysosomal pathway. 

One such polypeptide that spontaneously forms particles is HBV surface 
antigen (HBV-S) piminsky ei aL Vaccine 15:637-647 (1997); Le Borgne et aL 

30 Virology 240:304-3 1 5 (1 998)), each of which is incorporated herein by reference. 

Another polypeptide that spontaneously fonns particles is HBV core antigen (Kuhrober et 
al, International Immunol 9: 1203-1212 (1997)), which is incorporated herein by 
reference. Still another polypeptide that spontaneously forms particles is the yeast Ty 
protein (Weber et aL Vaccine 13:831-834 (1995)), which is incorporated herein by 
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reference. For example, an expression vector containing HBV-S antigen fused to a 
universal MHC class II epitope can be advantageously used to target HBV-S antigen and 
the universal MHC class n epitope to the MHC class II pathway for stimulating an 
immune response to HBV. 

5 

Binding Affinity of Peptide Epitopes for HLA Molecules 

The large degree of HLA polymorphism is an important factor to be taken 
into account with the epitope-based approach to vaccine development. To address this 
factor, epitope selection encompassing identification of peptides capable of binding at 
10 high or intermediate affinity to multiple HLA molecules is preferably utilized, most 
preferably these epitopes bind at high or intermediate affinity to two or more allele 
specific HLA molecules. 

CTL -inducing peptides of interest for vaccine compositions preferably 
include those that have a binding affinity for class I HLA molecules of less than 500 nM. 
15 HTL-inducing peptides preferably include those that have a binding affinity for class H 
HLA molecules of less than 1000 nM. For example, peptide binding is assessed by 
testing the capacity of a candidate peptide to bind to a purified HLA molecule in vitro. 
Peptides exhibiting high or intermediate affinity are then considered for further analysis. 
Selected peptides are tested on other members of the supertype family. In preferred 
20 embodiments, peptides that exhibit cross-reactive binding are then used in vaccines or in 
cellular screening analyses. 

Higher HLA binding affmity is typically correlated with greater 
immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is elicited at all, and to the 
25 vigor of any particular response, as well as to the extent of a population in which a 

response is elicited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
with these principles, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted with about 50% of the peptides which bind with intermediate 
30 affinity. Moreover, higher binding affinity peptides leads to more vigorous immunogenic 
responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity binding peptide is used. Thus, in prefenred embodiments of the invention, high 
binding epitopes are particularly usefijl. 
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The relationship between binding affinity for HLA class I molecules and 
inimunogenicity of discrete peptide epitopes on bound antigens has been determined for 
the first time in the art by the present inventors. The correlation between binding affinity 
and immunogenicity was analyzed in two different experimental approaches (Sette et ai, 
1 Immunol 153:5586-5592 (1994)). In the first approach, the immunogenicity of 
potential epitopes ranging in HLA binding affinity over a 10,000-fold range was analyzed 
in HLA-A*020l transgenic mice. In the second approach, the antigenicity of 
approximately 100 different hepatitis B virus (HBV)-derived potential epitopes, all 
carrying A*0201 binding motifs, was assessed by using PBL (peripheral blood 
lymphocytes) from acute„hepatitis P^^ant to these approaches, it was 

deteraiined that an affinity threshold of approximately 500 nM (preferably 50 nM or less) 
determines the capacity of a peptide epitope to ehcit a CTL response. These data are true 
for class I binding affinity measurements for naturally processed peptides and for 
synthesized T cell epitopes. These data also indicate the important role of determinant 
selection in the shaping of T cell responses {see, e.g., Schaeffer etal Proc, Natl Acad 
ScL USA 86:4649-4653, 1989). 

An affinity threshold associated with immunogenicity in the context of 
HLA class II DR molecules has also been delineated {see, e,g,, Southwood et ai J. 
Imn^ andUSSN 60/087192, filed 5/29/98), In order to ^ ' 

define a biologically significant threshold of DR binding affinity, a database of the 
binding afFmities of 32 DR-restricted epitopes for their restricting element (i.e., the HLA 
molecule that binds the motif) was compiled. In approximately half of the cases (1 5 of 32 
epitopes), DR restriction was associated with high binding affinities, i.e. binding affinities 
of less than 100 nM. In the other half of the cases (16 of 32), DR restriction was 
associated with intermediate affinity (binding affinities in the 100-1000 nM range). In 
only one of 32 cases was DR restriction associated with an IC50 of 1000 nM or greater. 
Thus, 1000 nM can be defined as an affinity threshold associated with immunogenicity in 
the context of DR molecules. 

Peptide Epitope Binding Motifs and Supermotifs 

In the past few years evidence has accumulated to demonstrate that a large 
fraction of HLA class I and class II molecules can be classified into a relatively few 
supertypes, each characterized by largely overiapping peptide binding repertoires, and. 
consensus structures of the main peptide binding pockets. 
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For HLA molecule pocket analyses, the residues comprising the B and F 
pockets of HLA class I molecules as described in crystallographic studies were analyzed 
(Guo et al. Nature 360:364 (1992); Saper et aL J- MoL Biol. 219:277 (1991); Madden et 
al, Cell 75:693 (1993); Parham et aL, Immunol. Rev. 143:141 (1995)). In these analyses, 

5 residues 9, 45, 63, 66, 67. 70, and 99 were considered to make up the B pocket; and the B 
pocket was deemed to determine the specificity for the amino acid residue in the second 
position of peptide ligands. Similarly, residues 77, 80, 81, and 1 16 were considered to 
determine the specificity of the F pocket; the F pocket was deemed to determine the 
specificity for the C-terminal residue of a peptide ligand bound by the HLA class I 

10 molecule. 

Through the study of single amino acid substituted antigen analogs and the 
sequencing of endogenously bound, naturally processed peptides, critical residues 
required for allele-specific binding to HLA molecules have been identified. The presence 
of these residues correlates with binding affinity for HLA molecules. The identification 

15 of motifs and/or supermotifs that correlate with high and intermediate affinity binding is 
an important issue with respect to the identification of immunogenic peptide epitopes for 
the inclusion in a vaccine. Kast et aL (/ ImmunoL 152:3904-3912 (1994)) have shown 
that motif-bearing peptides account for 90% of the epitopes that bind to allele-specific 
HLA class I molecules. In this study all possible peptides of 9 amino acids in length and 

20 overiapping by eight amino acids (240 peptides), which cover the entire sequence of the 
E6 and E7 proteins of human papillomavirus type 16, were evaluated for binding to five 
allele-specific HLA molecules that are expressed at high firequency among different 
ethnic groups. This unbiased set of peptides allowed an evaluation of the predictive value 
of HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that 

25 bound to an allele-specific HLA molecules with high or intermediate affinity. Of these 
22 peptides, 20, (i.e., 91%), were motif-bearing. Thus, this study demonstrates the value 
of motifs for the identification of peptide epitopes for inclusion in a vaccine: application 
of motif-based identification techniques eliminates screening of 90% of the potential 
epitopes in a target antigen protein sequence. 

30 Peptides of the present invention may also include epitopes that bind to 

MHC class II DR molecules. There is a significant difference between class I and class II 
HLA molecules. This difference corresponds to the fact that, although a stringent size 
restriction and motif position relative to the binding pocket exists for peptides that bind to 
class I molecules, a greater degree of heterogeneity in both size and binding firame 
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position of the motif, relative to the N and C termini of the peptide, exists for class II 
peptide ligands. 

This increased heterogeneity of HLA class 11 peptide ligands is due to the 
structure of the binding groove of the HLA class II molecule which, unlike its class I 
counterpart, is open at both ends, Crystallographic analysis of HLA class II DRB*010l- 
peptide complexes showed that the residues occupying position 1 and position 6 of 
peptides complexed with DRB*0101 engage two complementary pockets on the 
DRBa*0101 molecules, with the PI position corresponding to the most crucial anchor 
residue and the deepest hydrophobic pocket {see, e.g., Madden, Ann, Rev, Immunol 
13:587 (1995)). Other studies have also pointed to the P6 position as a crucial anchor 
residue for binding to various other DR molecules. 

Thus, peptides of the present invention are identified by any one of several 
HLA class I or II -specific amino acid motifs {see, e.g.. Tables Mil of USSN 09/226,775, 
and 09/239,043, herein incorporated by reference in their entirety). If the presence of the 
motif corresponds to the ability to bind several allele-specific HLA antigens it is referred 
to as a supermotif The allele-specific HLA molecules that bind to peptides that possess a 
particular amino acid supermotif are collectively referred to as an HLA "supertype." 

Immune Response-Stimulating Peptide Analogs 

In general, CTL and HTL responses are not directed against all possible 
epitopes. Rather, they are restricted to a few "inununodominant" determinants 
(Zinkemagel et al, Adv, Immunol 27:5159 (1979); Bennink et aL J. Exp, Med, 
168:1935-1939 (1988); Rawle et al, J. Immunol 146:3977-3984 (1991)). It has been 
recognized that immunodominance (Benacerraf et al, Science 175:273-279 (1972)) could 
be explained by either the ability of a given epitope to selectively bmd a particular HLA 
protein (determinant selection theory) (Vitiello et al, 1 Immunol 131:1635 (1983)); 
Rosenthal et aL Nature 267:156-158 (1977)), or being selectively recognized by the 
existing TCR (T cell receptor) specificity (repertoire theory) (Klein, Immunology, The 
Science of Self on self Discrimination, pp. 270-3 10 (1982)). It has been demonstrated that 
additional factors, mostly linked to processing events, can also play a key role in 
dictating, beyond strict immunogenicity, which of the many potential determinants will 
be presented as immunodominant (Sercarz et al, Annu, Rev. Immunol 1 1 
(1993)). 
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The concept of dominance and subdominance is relevant to 
immunotherapy of both infectious diseases and cancer. For example, in the course of 
chronic viral disease, recruitment of subdominant epitopes can be important for 
successful clearance of the infection, especially if dominant CTL or HTL specificities 
5 have been inactivated by functional tolerance, suppressioii, mutation of viruses and other 
mechanisms (Franco et al, Curr Opin. Immunol 7:524-531 (1995)). In the case of 
cancer and tumor antigens, CTLs recognizing at least some of the highest binding affinity 
peptides might be functionally inactivated. Lower binding affinity peptides are 
preferentially recognized at these times, and may therefore be preferred in therapeutic or 
10 prophylactic anti-cancer vaccines. 

In particular, it has been noted that a significant number of epitopes 
derived from known non-viral tumor associated antigens (TAA) bind HLA class I with 
intermediate affinity (IC50 in the 50-500 nM range). For example, it has been found that 
8 of 15 known TA.^ peptides recognized by tumor infiltrating lymphocytes (TIL) or CTL 
15 bound in the 50-500 nM range. (These data are in contrast with estimates that 90% of 
known viral antigens were bound by HLA class I molecules with IC50 of 50 nM or less, 
while only approximately 10% bound in the 50-500 nM range (Sette et al, 1 Immunol, 
153:558-5592 (1994)). In the cancer setting this phenomenon is probably due to 
elimination, or functional inhibition of the CTL recognizing several of the highest binding 
20 peptides, presumably because of T cell tolerization events. 

Without intending to be bound by theory, it is believed that because T cells 
to dominant epitopes may have been clonally deleted, selecting subdominant epitopes 
may allow extant T cells to be recruited, which will then lead to a therapeutic or 
prophylactic response. However, the binding of HLA molecules to subdominant epitopes 
25 is often less vigorous than to dominant ones. Accordingly, there is a need to be able to 
modulate the binding affinity of particular immunogenic epitopes for one or more HLA 
molecules, and thereby to modulate the immune response elicited by the peptide, for 
example to prepare analog peptides which elicit a more vigorous response. This ability 
would greatly enhance the usefulness of peptide-based vaccines and therapeutic agents. 
30 Thus, although peptides with suitable cross-reactivity among all alleles of 

a superfamily are identified by the screening procedures described above, cross-reactivity 
is not always as complete as possible, and in certain cases procedures to further increase 
cross-reactivity of peptides can be useful; moreover, such procedures can also be used to 
modify other propenies of the peptides such as binding affinity or peptide stability. 
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Having established the general rules that govern cross-reactivity of peptides for HLA 
alleles within a given motif or supermotif, modification (i.e., analoging) of the structure 
of peptides of particular interest in order to achieve broader (or otherwise modified) HLA 
binding capacity can be performed. More specifically, peptides which exhibit the 

5 broadest cross-reactivity patterns, can be produced in accordance with the teachings 
herein. The present concepts related to analog generation are set forth in greater detail in 
co-pending USSN 09/226,775. 

In brief, the strategy employed utilizes the motifs or supermotifs which 
correlate with binding to certain HLA class I and 11 molecules. The motifs or supermotifs 

10 are defined by having primary anchors, and in many cases secondary anchors (see Tables 
I-III of USSN 09/226,775). Analog peptides can be created by substituting amino acids 
residues at primary anchor, secondary anchor, or at primary and secondary anchor 
positions. Generally, analogs are made for peptides that already bear a motif or 
supermotif Preferred secondary anchor residues of supermotifs and motifs that have 

1 5 been defined for HLA class I and class H binding peptides are shown in Tables 11 and HI, 

respectively, of USSN 09/226,775. 

For a number of the motifs or supermotifs in accordance with the 

invention, residues are defined which are deleterious to binding to allele-specific HLA 
molecules or members of HLA supertypes that bind to the respective motif or supermotif 

20 {see Tables II and III of USSN 09/226,775). Accordingly, removal of such residues that 
are detrimental to binding can be performed in accordance with the methods described 
therein. For example, in the case of the A3 supertype, when all peptides that have such 
deleterious residues are removed firom the population of analyzed peptides, the incidence 
of cross-reactivity increases from 22% to 37% (I., Sidney et al, Hu, Immunol 45:79 

25 (1996)). Thus, one strategy to improve the cross-reactivity of peptides within a given 
supermotif is simply to delete one or more of the deleterious residues present within a 
peptide and substitute a small "neutral" residue such as Ala (that may not influence T cell 
recognition of the peptide). An enhanced likelihood of cross-reactivity is expected if, 
together with elimination of detrimental residues within a peptide, "preferred" residues 

30 associated with high affinity binding to an allele-specific HLA molecule or to multiple 
HLA molecules within a superfamily are inserted. 

To ensure that an analog peptide, when used as a vaccine, actually elicits a 
CTL response to the native epitope in vivo (or, in the case of class II epitopes, a failure to 
elicit helper T cells that cross-react with the wild type peptides), the analog peptide may 
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be used to immunize T cells in vitro fromlndividuals of the appropriate HLA allele. 
Thereafter, the immunized cells' capacity to induce lysis of wild type peptide sensitized 
target cells is evaluated. In both class I and class II systems it will be desirable to use as 
targets, cells that have been either infected or transfected with the appropriate genes to 
5 establish whether endogenously produced antigen is also recognized by the relevant T 
cells. 

Another embodiment of the invention is to create analogs of weak binding 
peptides, to thereby eii^ure adequate numbers of cross-reactive cellular binders. Class I 
peptides exhibiting binding affinities of 500-50000 nM, and carrying an acceptable but 
10 suboptimal primary anchor residue at one or both positions can be "fixed" by substituting 
preferred anchor residues in accordance with the respective supertype. The analog 
peptides can then be tested for crossbinding activity. 

Another embodiment for generating effective peptide analogs involves the 
substitution of residues that have an adverse impact on peptide stability or solubility in^ 
1 5 e.g., a Uquid environment. This substitution may occur at any position of the peptide 
epitope. For example, a cysteine (C) can be substituted out in favor of gamma-amino 
butyric acid. Due to its chemical nature, cysteine has the propensity to form "disulfide 
bridges and sufficiently alter the peptide structurally so as to reduce binding capacity. 
Substituting gamma-amino butyric acid for C not only alleviates this problem, but 
20 actually improves binding and crossbinding capability in certain instances (Sette et al, In: 
Persistent Viral Infections (Ahmed & Chen, eds., 1998)). Substitution of cysteine with 
gamma-amino butj^c acid may occur at any residue of a peptide epitope, i.e., at either 
anchor ornon-anchor positions. 

25 Expression Vectors and Construction of a Minigene 

The expression vectors of the invention contain at least one promoter 
element that is capable of expressing a transcription unit encoding the antigen of interest, 
for example, a MHC class I epitope or a MHC class II epitope and an MHC targeting 
sequence in the appropriate cells of an organism so that the antigen is expressed and 

30 targeted to the appropriate MHC molecule. For example, if the expression vector is 
administered to a mammal such as a human, a promoter element that functions in a 
human cell is incorporated into the expression vector. An example of an expression 
vector useful for expressing the MHC class II epitopes fused to MHC class II targeting 
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sequences and the MHC class I epitopes described herein is the pEP2 vector described in 
Example IV. 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et aL, Molecular Cloning, A Laboratory Manual (2nd ed. 1989); Kriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel et aL, eds., 1994); Oligonucleotide Synthesis: A Practical 
Approach (Gait, ed., 1984); Kuijpers, Nucleic Acids Research 18(17):5197 (1994); 
Dueholm, J. Org Chem. 59:5767-5773 (1994); Methods in Molecular Biology, volume 
20 (Agrawal, ed,); and Tijssen, Laboratory Techniques in Biochemistry and Molecular 
Biology-Hybridization with Nucleic Acid Probes, e.g., Part I, chapter 2 "Overview of 
principles of hybridization and the strategy of nucleic acid probe assays" (1993)). 

The minigenes are comprised of two or many different epitopes {see, e.g., 
Tables 1-8). The nucleic acid encoding the epitopes are assembled in a minigene 
according to standard techniques. In general, the nucleic acid sequences encoding 
minigene epitopes are isolated using amplification techniques with oligonucleotide 
primers, or are chemically synthesized. Recombinant cloning techniques can also be used 
when appropriate. Oligonucleotide sequences are selected which either amplify (when 
using PGR to assemble the minigene) or encode (when using synthetic oligonucleotides to 
assemble the minigene) the desired epitopes. 

Amplification techniques using primers are typically used to amplify and 
isolate sequences encoding the epitopes of choice firom DNA or RNA (see U.S. Patents 
4,683,195 and 4,683,202; PCR Protocols: A Guide to Methods and Applications (Innis et 
ai, eds, 1990)). Methods such as polymerase chain reaction (PGR) and ligase chain 
reaction (LGR) can be used to amplify epitope nucleic acid sequences directly from 
mRNA, from cDNA, from genomic libraries or cDNA libraries. Restriction endonuclease 
sites can be incorporated into the primers. Minigenes amplified by the PGR reaction can 
be purified from agarose gels and cloned into an appropriate vector. 

Synthetic oligonucleotides can also be used to construct minigenes. This 
method is performed using a series of overlapping oligonucleotides, representing both the 
sense and non-sense strands of the gene. These DNA firagments are then annealed, 
ligated and cloned. Ohgonucleotides that are not commercially available can be 
chemically synthesized according to the solid phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
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automated synthesizer, as described in Van Devanter et. ai. Nucleic Acids Res. 12:6159- 
6168 (1984). Purification of oligonucleotides is by either native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in Pearson & Reanier, J. 
Chrom. 255:137-149 (1983). 

5 The epitopes of the minigene are typically subcloned into an expression 

vector that contains a strong promoter to direct transcription, as well as other regulatory 
sequences such as enhancers and polyadenylation sites. Suitable promoters are well 
known in the art and described, e.g., in Sambrook et al. and Ausubel et al. Eukaryotic 
expression systems for mammalian cells are well known in the art and are commercially 

1 0 available. Such promoter elements include, for example, cytomegalovirus (CMV), Rous 
sarcoma virus LTR and SV40. 

The expression vector typically contains a transcription unit or expression 
cassette that contains all the additional elements required for the expression of the 
minigene in host cells. A typical expression cassette thus contains a promoter operably 

1 5 linked to the minigene and signals required for efficient polyadenylation of the transcript. 
Additional elements of the cassette may include enhancers and introns with functional 

splice donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette can also 

contain a transcription termination region downstream of the structural gene to provide 

20 for efficient termination. The termination region may be obtained from the same gene as 

the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 

into the cell is not particularly critical. Any of the conventional vectors used for 

expression in eukarv otic cells may be used. Expression vectors containing regulatory 

25 elements from eukaryotic viruses are typically used in eukaryotic expression vectors, e.g., 

SV40 vectors, papilloma virus vectors, and vectors derived from Epstein Bar virus. Other 

exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, 

baculovirus pDSVE, and any other vector allowing expression of proteins under the 

direction of the SV40 early promoter, SV40 later promoter, metallothionein promoter, 

30 murine mammary tumor virus promoter, Rous sarcoma viriis promoter, polyhedrin 

promoter, or other promoters shown effective for expression in eukaryotic cells. In one 

embodiment, the vector pEP2 is used in the present invention. 

Other elements that are typically included in expression vectors also 

include a replicon that functions in E. coli, a gene encoding anUbiotic resistance to permit 
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selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
such that they do not interfere with the replication of the DNA in eukaryotic cells, if 
necessary. 



Administration //I Vivo 

The invention also provides methods for stimulating an immune response 
by administering an expression vector of the invention to ari iridividual. Administration 
of an expression vector of the invention for stimulating an immune response is 
advantageous because the expression vectors of the invention target MHC epitopes to 
MHC molecules, thus increasing the number of CTL and HTL activated by the antigens 
encoded by the expression vector. 

Initially, the expression vectors of the invention are screened in mouse to 
determine the expression vectors having optimal activity in stimulating a desired immune 
response. Initial smdies are therefore carried out, where possible, with mouse genes of 
the MHC targeting sequences. Methods of determining the activity of the expression 
vectors of the invention are well known in the art and include, for example, the uptake of 
^H-thymidine to measure T cell activation and the release of ^^Cr to measure CTL activity 
as described below in Examples II and in. Experiments similar to those described in 
Example IV are performed to determine the expression vectors having activity at 
stimulating an immune response. The expression vectors having activity are further 
tested in human. To circumvent potential adverse immunological responses to encoded 
mouse sequences, the expression vectors having activity are modified so that the MHC 
class n targeting sequences are derived from human genes. For example, substitution of 
the analogous regions of the human homologs of genes containing various MHC class II 
targeting sequences are substituted into the expression vectors of the invention. 
Examples of such human homologs of genes containing MHC class 11 targeting sequences 
are shown in Figures 12 to 17. Expression vectors containing human MHC class II 
targeting sequences, such as those described in Example I below, are tested for activity at 
stimulating an immune response in human. 

The invention also relates to pharmaceutical compositions comprising a 
pharmaceutically acceptable carrier and an expression vector of the invention. 
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Pharmaceutically acceptable carriers are well known in the art and include aqueous or 
non-aqueous solutions, suspensions and emulsions, including physiologically buffered 
saline, alcohol/aqueous solutions or other solvents or vehicles such as glycols, glycerol, 
oils such as olive oil or injectable organic esters. 
5 A pharmaceutically acceptable carrier can contain physiologically 

acceptable compounds that act, for example, to stabilize the expression vector or increase 
the absorption of the expression vector. Such physiologically acceptable compounds 
include, for example, carbohydrates, such as glucose, sucrose or dextrans, antioxidants 
such as ascorbic acid or glutathione, chelating agents, low molecular weight polypeptides, 
10 antimicrobial agents, inert gases or other stabilizers or excipients. Expression vectors can 
additionally be complexed with other components such as peptides, polypeptides and 
carbohydrates. Expression vectors can also be complexed to particles or beads that can 
be administered to an individual, for example, using a vaccine gun. One skilled in the art 
would know that the choice of a pharmaceutically acceptable carrier, including a 
15 physiologically acceptable compound, depends, for example, on the route of 
administration of the expression vector. 

The invention further relates to methods of administering a pharmaceutical 
composition comprising an expression vector of the invention to stimulate an immune 
response. The expression vectors are administered by methods well known in tlie art as 
20 described in Donnelly et al {Ann. Rev. Immunol 15:617-648 (1997)); Feigner et al (U.S. 
Patent No. 5,580,859, issued December 3, 1996); Feigner (U.S. Patent No. 5,703,055, 
issued December 30, 1997); and Carson et al (U.S. Patent No. 5,679,647, issued October 
21, 1997), each of which is incorporated herein by reference. In one embodiment, the 
minigene is administered as naked nucleic acid. 
25 A pharmaceutical composition comprising an expression vector of the 

invention can be administered to stimulate an immune response in a subject by various 
routes including, for example, orally, intravaginally, rectally, or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally, intracapsularly, 
intraperitoneally, intracistemally or by passive or facilitated absorption through the skin 
30 using, for example, a skin patch or transdermal iontophoresis, respectively. Furthermore, 
the composition can be administered by injection, intubation or topically, the latter of 
which can be passive, for example, by direct application of an ointment or powder, or 
active, for example, using a nasal spray or inhalant. An expression vector also can be 
administered as a topical spray, in which case one component of the composition is an 
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appropriate propellant. The phannaceutical composition also can be incorporated, if 
desired, into liposomes, microspheres or other polymer matrices (Feigner et al., U.S. 
Patent No. 5,703,055; Gregoriadis, Liposome Technology, Vols. I to HI (2nd ed. 1993), 
each of which is incorporated herein by reference). Liposomes, for example, which 
5 consist of phospholipids or other lipids, are nontoxic, physiologically acceptable and 
metabolizable carriers that are relatively simple to make and administer. 

The expression vectors of the invention can be delivered to the interstitial 
spaces of tissues of an ammal body (Feigner et al, U.S. Patent Nos. 5,580,859 and 
5,703,055). Administration of expression vectors of the invention to muscle is a 
10 particularly effective method of administration, including intradermal and subcutaneous 
injections and transdermal administration. Transdermal administration, such as by 
iontophoresis, is also an effective method to deliver expression vectors of the invention to 
muscle. Epidermal administration of expression vectors of the invention can also be 
employed. Epidermal administration involves mechanically or chemically irritating the 
1 5 outermost layer of epidermis to stimulate an immune response to the irritant (Carson et 
a/., U.S. PatMit No. 5,679,647). 

Other effective methods of administering an expression vector of the 
invention to stimulate an immune response include mucosal adminisfaration (Carson er a/., 
U.S. Patent No. 5,679,647). For mucosal administration, the most effective method of 
20 administration includes intranasal administration of an appropriate aerosol containing the 
expression vector and a pharmaceutical composition. Suppositories and topical 
preparations are also effective for delivery of expression vectors to mucosal tissues of 
genital, vaginal and ocular sites. Additionally, expression vectors can be complexed to 
particles and administered by a vaccine gun. 
25 The dosage to be administered is dependent on the method of 

administration and will generally be between about 0.1 \ig up to about 200 p.g. For 
example, the dosage can be from about 0.05 ng/kg to about 50 mg/kg, in particular about 
0.005-5 mg/kg. An effective dose can be determined, for example, by measuring the 
immune response after administration of an expression vector. For example, the 
30 production of antibodies specific for the MHC class H epitopes or MHC class I epitopes 
encoded by tiie expression vector can be measured by metiiods well known in the art, 
including ELISA or other immunological assays. In addition, the activation of T helper 
cells or a CTL response can be measured by methods well known in the art including, for 



- 32 - 



wo 99/58658 PCT/US99/10646 

example, the uptake of ^H-thymidine to measure T cell activation and the release of ^*Cr 
to measure CTL activity {see Examples II and III below). 

The pharmaceutical compositions comprising an expression vector of the 
invention can be administered to mammals, particularly humans, for prophylactic or 
therapeutic purposes. Examples of diseases that can be treated or prevented using the 
expression vectors of the invention include infection with HBV, HCV, HIV and CMV as 
well as prostate cancer, renal carcinoma, cervical carcinoma, lymphoma, condyloma 
acuminatum and acquired immunodeficiency syndrome (AIDS). 

In therapeutic applications, the expression vectors of the invention are 
administered to an individual akeady suffering from cancer, autoinimune disease or 
infected with a virus. Those in the incubation phase or acute phase of the disease can be 
treated with expression vectors of the invention, including those expressing all universal 
MHC class II epitopes, separately or in conjunction with other treatments, as appropriate. 

In therapeutic and prophylactic applications, pharmaceutical compositions 
comprising expression vectors of the invention are administered to a patient in an amount 
sufficient to elicit an effective immune response to an antigen and to ameliorate the signs 
or symptoms of a disease. The amount of expression vector to administer that is 
sufficient to ameliorate the signs or symptoms of a disease is termed a therapeutically 
effective dose. The amount of expression vector sufficient to achieve a therapeutically 
effective dose will depend on the pharmaceutical composition comprising an expression 
vector of the invention, the manner of administration, the state and severity of the disease 
being treated, the weight and general state of health of the patient and the judgment of the 
prescribing physician. 

All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understandmg, it will be readily 
apparent to one of ordinary skill in the art in Hght of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
or scope of the appended claims. 
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EXAMPLES 

The following example is provided by way of illustration only and not by 
way of limitation. Those of skill in the art will readily recognize a variety of noncritical 
parameters that could be changed or modified to yield essentially similar results. 

5 

F.XAN^LE I: Construction of Expression Vecto rs Containing MHC Class II Epitopes 

This example shows construction of expression vectors containing MHC 
class II epitopes that can be used to target antigens to MHC class II molecules. 

Expression vectors comprising DNA constructs were prepared using 
10 overlapping oligonucleotides, polymerase chain reaction (PCR) and standard molecular 
biology techniques (Dieffenbach & Dveksler, PCR Primer: A Laboratory Manual (1995); 
Sambrook et al. Molecular Cloning: A Laboratory Manual (2nd ed., 1989), each of 
which is incorporated herein by reference). 

To generate full length wild type li, the full length invariant chain was 
1 5 amplified, cloned, and sequenced and used in the construction of the three invariant chain 
constructs. Except wher? noted, the source of cDNA for all the constructs listed below 
was Mouse Spleen Marathon-Ready cDNAmade from Balb/c males (Clontech; Palo Alto 
CA). The primer p.airs were the oligonucleotide 

GCTAGCGCCGCCACCATGGATGACCAACGCGACCTC (SEQ ID NO:40), which is 
20 designated murli-F and contains an Nhel site followed by the consensus Kozak sequence 
and the 5' end of the li cDNA; and the oligonucleotide 

GGTACCTCACAGGGTGACTTGACCCAG (SEQ ID N0:41), which is designated 
murli-R and contains a Kpnl site and the 3' end of the li coding sequence. 

For the PCR reaction, 5 ]i\ of spleen cDNA and 250 nM of each primer 

25 were combined in a 1 00 ^il reaction with 0.25 mM each dNTP and 2.5 units of Pfu 

polymerase in Pfu polymerase buffer containing 10 mM KCl, 10 raM (NH4)2S04, 20 mM 
Tris-chloride, pH 8.75, 2 mM MgS04, 0.1% TRITON X-100 and 100 ng/ml bovine serum 
albumin (BSA). A Perkin/Ehner 9600 PCR machine (Perkin Etaier; Foster City CA) was 
used and the cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 30 

30 cycles of 95°C for 1 5 seconds. 52''C for 30 seconds, and 72°C for 1 minute. The PCR 
reaction was run on a 1 % agarose gel, and the 670 base pair product was cut out, purified 
by spinning through a Millipore Ultrafree-MC filter (MilUpore; Bedford MA) and cloned 
into pCR-Blunt from Invitrogen (San Diego, CA). Individual clones were screened by 
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sequencing, and a correct clone (named bli#3) was used as a template for the helper 
constmcts. 

DNA constructs containing pan DR epitope sequences and MHC II 
targeting sequences derived from the li protein were prepared. The li murine protein has 
been previously described (Zhu & Jones, Nucleic Acids Res. 17:447-448 (1989)), which is 
incorporated herein by reference. Briefly, the liPADRE construct contains the full length 
li sequence with PADRE precisely replacing the CLIP region. The DNA construct 
encodes amino acids 1 through 87 of invariant chain, followed with the 13 amino acid 
PADRE sequence (SEQ ID NO:38) and the rest of the invariant chain DNA sequence 
(amino acids 101-215). The construct was amplified in 2 overiapping halves that were 
joined to produce the final construct. The two primers used to ampUfy the 5* half were 
murli-F and the oligonucleotide 

CAGGGTCCAGGCAGCCACGAACTTGGCC ACAGGTTTGGCAGA (SEQ ID 
NO:42), which is designated liPADRE-R. The liPADRE-R primer includes nucleotides 
303-262 of liPADRE. The 3' half was amplified with the primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID 
NO:43), which is designated liPADRE-F and includes nucleotides 288-330 of liPADRE; 
and murli-R. The PGR conditions were the same as described above, and the two halves 
were isolated by agarose gel electrophoresis as described above. 

Ten microliters of each PGR product was combined in a 100 fil PGR 
reaction with an annealing temperature of 50°C for five cycles to generate a full length 
template. Primers murli-F and murli-R were added and 25 more cycles carried out. The 
fiill length liPADRE product was isolated, cloned, and sequenced as described above. 
This construct contains the murine li gene with a pan DR epitope sequence substituted for 
the CLIP sequence of li (Figure 1). 

A DNA construct, designated I80T, containing the cytoplasmic domain, the 
transmembrane domain and part of the luminal domain of li fused to a string of multiple 
MHC class II epitopes was constructed (Figure 2). Briefly, the string of multiple MHC 
class II epitopes was constructed with three overiapping oligonucleotides (ohgos). Each 
ohgo overlapped its neighbor by 15 nucleotides and the fmal MHC class 11 epitope string 
was assembled by extending the overlapping ohgonucleotides in tiiree sets of reactions 
using PGR. The three oligonucleotides were: ohgo 1, nucleotides 241-310, 
CTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAA 

CGAAGCTGGAAGAACCC (SEQ ID NO:44); 
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oligo 2, nucleotides 364-295, 
TTCTGGTCAGCAGAAAGAACAGGATAGGAGCGTTTGGAGGGCGATAAGCTGG 

AGGGGTTCTTCCAGCTTC (SEQ ID NO:45); and 
oligo 3, nucleotides 350-42, 
5 TTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTG 

GCTGCCTGGACCCTGAAG (SEQ ID NO:46). 

For the first PGR reaction, 5 |ig of oligos 1 and 2 were combined in a 100 
^1 reaction containing P/w polymerase. APerkin/Elmer 9600 PGR machine was used and 
the annealing temperature used was 45° C. The PGR product was gel-purified, and a 
10 .. .. second reaction containing the PGR product of oligos 1 and.2 with oligo 3 was annealed . 
and extended for 10 cycles before gel purification of the fiiil length product to be used as 
a "mega-primer." 

The I80T construct was made by amplifying bli#3 with murli-F and the 
mega-primer. The cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 5 
15 cycles of 95°C for 15 seconds, 37''C for 30 seconds, and 72°C for 1 minute. Primer Help- 
epR was added and an additional 25 cycles were carried out with the anneahng 
temperature raised to 47°C. The Help-epR primer 

GGTACCTCAAGCGGCAGCCTTCAGGGTCCAGGCA (SEQ ID NO:47) corresponds 
to nucleotides 438-405. The fiill length I80T product was isolated, cloned, and sequenced 
20 as above. 

The I80T construct (Figure 2) encodes amino acid residues 1 through 80 of 
li, containing the cytoplasmic domain, the transmembrane domain and part of the luminal 
domain, fiised to a string of multiple MHO class II epitopes corresponding to: amino acid 
residues 323-339 of ovalbumm 
25 (IleSerGhiAlaValHisAlaAlaHisAlaGluIleAsnGluAlaGlyArg; SEQ ID NO:48); amino 
acid residues 128 to l4l of HBV core antigen (amino acids 

ThrProProAlaTyrArgProProAsnAlaProIleLeu; SEQ ID NO:49); amino acid residues 182 
to 196 of HBV env (amino acids PhePheLeuLeuThrArglleLeuThrlleProGlnSerLeuAsp; 
SEQ ID NO:50); and the pan DR sequence designated SEQ ID NO:38. 
30 A DNA construct containing the cytoplasmic domain, transmembrane 

domain and a portion of the luminal domain of li fused to the MHC class II epitope string 
shown in Figure 2 and amino acid residues 101 to 215 of li encoding the trimerization 
region of li was generated (Figure 3). This construct, designated liThfull, encodes the 
first 80 amino acids of invariant chain followed by the MHC class II epitope string 
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(replacing CLIP) and the rest of the invariant chain (amino acids 101-215). Briefly, the 
construct was generated as two overlapping halves that were annealed and extended by 
PGR to yield the final product. 

The 5' end of liThfiill was made by amplifying I80T with murli-F (SEQ 
ID NO:40) and Th-Pad-R. The Th-Pad-R primer AGCGGCAGCCTTCAGGGTC (SEQ 
ID N0:51) corresponds to nucleotides 429-411. The 3' half was made by amplifying 
bli#3 with liPADRE-F and murli-R (SEQ ID N0:41). The liPADRE-F primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ED NO:52) 
corresponds to nucleotides 402-444. Each PGR product was gel purified and mixed, then 
denatured, annealed, and extended by five cycles of PGR. Primers murli-F (SEQ ID 
NO:40) and murli-R (SEQ ID N0:41) were added and another 25 cycles performed. The 
fill! length product was gel purified, cloned, and sequenced. 

All of the remaining constructs described below were made essentially 
according to the scheme shown in Figure 18. Briefly, primer pairs IF plus IR, designated 
below for each specific construct, were used to amplify the specific signal sequence and 
contained an overiapping 15 base pair tail identical to the 5' end of the MHC class II 
epitope string. Primer pair Th-ova.F,ATGAGCCAGGCTGTGCACGC (SEQ ID n6:53), 
plus Th-Pad-R (SEQ ID N0:51) were used to amplify the MHC class II epitope string. A 
15 base pair overlap and the specific transmembrane and cytoplasmic tail containing the 
targeting signals were amplified with primer pairs 2F plus 2R. 

All three pieces of each cDNA were amplified using the following 
conditions: 1 cycle of 95T for 5 minutes, followed by 30 cycles of 95°C for 15 seconds, 
52^C for 30 seconds, and 72°C for 1 minute. Each of the three fragments was agrose-gel 
purified, and the signal sequence and MHG class II string firagments were combined and 
joined by five cycles in a second PGR. After five cycles, primers IF and Th-Pad-R were 
added for 25 additional cycles and the PGR product was gel purified. This signal 
sequence plus MHC class II epitope string fragment was combined with the 
transmembrane plus cytoplasmic tail fragment for the final PGR. After five cycles, 
primers IF plus 2R u ere added for 25 additional cycles and the product was gel purified, 
cloned and sequenced. 

A DNA construct containing the murine immunoglobulin kappa signal 
sequence fused to the T helper epitope string shown in Figure 2 and the transmembrane 
and cytoplasmic domains of LAMP-1 was generated (Figure 4) (Granger et aL Biol 
Chem, 265:12036-12043 (1990)), which is incorporated by reference (mouse LAMP-l 
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GenBank accession No. M32015). This construct, designated kappaLAMP-Th, contains 
the consensus mouse immunoglobulin kappa signal sequence and was amplified from a 
plasmid containing full length immunoglobulin kappa as depicted in Figure 18. The 
primer IF used was the oligonucleotide designated KappaSig-F, 
5 GCTAGCGCCGCCACCATGGGAATGCAG (SEQ ID NO:54). 

The primer IR used was the ohgonucleotide designated Kappa-Th-R, 
CACAGCCTGGCTGATTCCTCTGGACCC (SEQ ID NO:55). 

The primer 2F used was the ohgonucleotide designated PAD/LAMP-F, 
CTGAAGGCTGCCGCTAACAACATGTTGATCCCC (SEQ ID NO:56). The primer 2R 
10 used was the oligonucleotide designated LAMP-CYTOR, 
GGTACCCTAGATGGTCTGATAGCC (SEQ ID NO:57). 

ADN.A construct containing the signal sequence of H2-M fiised to the 
MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-M was generated (Figure 5). The mouse H2-M gene has been described 
15 previously, Peleraux et al, Immunogenetics 43:204-214 (1996)), which is incorporated 
heron by reference. This construct was designated H2M-Th and was constructed as 
depicted in Figure 18. The primer IF used was the ohgonucleotide designated H2-Mb- 
IF, GCC GCT AGC GCC GCC ACC ATG GCT GCA CTC TGG (SEQ ID NO:58). The 
primer IR used was the ohgonucleotide designated H2-Mb-1R, CAC AGC CTG GCT 
20 GAT CCC CAT ACA GTG CAG (SEQ ID NO:59). The primer 2F used was the 

ohgonucleotide designated H2.Mb-2F, CTG AAG GCT GCC GCT AAG GTC TCT GTG 
TCT (SEQ ED NO:60). The primer 2R used was the oligonucleotide designated H2-Mb- 
2R. GCG GGT ACC CTAATG CCG TCC TTC (SEQ ID N0:61). 

A DNA construct containing the signal sequence of H2-D0 fused to the 
25 MHC class 11 epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-D0 was generated (Figure 6). The mouse H2-D0 gene has been 
described previously (Larhammar et al. J. Biol. Chem. 260:14111-14119 (1985)), which 
is incorporated herein by reference (GenBank accession No. Ml 9423). This construct, 
designated H20-Th, was constructed as depicted in Figure 18. The primer IF used was 
30 the oligonucleotide designated H2-0b-lF, GCG GCT AGC GCC GCC ACC ATG GGC 
GCT GGG AGG (SEQ ID NO:62). The primer IR used was the ohgonucleotide 
designated H2-0b-lR, TGC ACA GCC TGG CTG ATG GAA TCC AGC CTC (SEQ ED 
NO:63). The primer 2F used was the oligonucleotide designated H2-Ob-2F, CTG AAG 
GCT GCC GCT ATA CTG AGT GGA GCT (SEQ ID NO:64). The primer 2R used was 
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the oligonucleotide designated H2-Ob-2R, GCC GGT ACC TCA TOT GAC ATG TCC 

CG (SEQ ID NO:65). 

A DNA construct containing a pan DR epitope sequence (SEQ ID NO:38) 
fused to the amino-tenninus of influenza matrix protein is generated (Figure 7). This 
construct, designated PADRE-Influenza matrix, contains the universal MHC class II 
epitope PADRE attached to the amino terminus of the influenza matrix coding sequence. 
The construct is made using a long primer on the 5' end primer. The 5' primer is the 
oligonucleotide 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTATGAGTCTTCTAACCGAGGTCGA (SEQ ID NO:66). The 3 ' primer is the 
oligonucleotide TC ACTTGAATCGCTGCATCTGC ACCCCC AT (SEQ ID NO:67). 
Influenza virus from the America Type Tissue Collection (ATCC) is used as a source for 
the matrix coding region (Perdue et al. Science 279:393-396 (1998)), which is 
incorporated herein by reference (GenBank accession No. AF0363 58). 

A DNA construct containing a pan DR epitope sequence (SEQ ID N0:3 8) 
fused to the amino-terminys of HBV-S antigen was generated (Figure 8). This construct 
is designated PADRE-HBV-s and was generated by annealing two overlapping 
oligpnucleptides to add PADKE onto the amino terminus of hepatitis B surface antigen 
(Michel era/.. Proc. Natl. Acad. Sci. USA 81:7708-7712 (1984); Michel et al, Proc. Natl. 
Acad. Sci. USA 92:5307-5311 (1995)), each of which is incorporated herein by reference. 
One oligonucleotide was 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTC (SEQ ID NO:68). The second oligonucleotide was 

CTCGAGAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGGCCATGGTG 
GCGGCG (SEQ ID NO:69). When annealed, the oligos have Nhel and Xhol cohesive 
ends. The oligos were heated to 100°C and slowly cooled to room temperature to anneal. 
A three part ligation joined PADRE with an Xhol-Kpnl fragment containing HBV-s 
antigen into the Nhel plus Kpnl sites of the expression vector. 

A DNA construct containing the signal sequence of Ig-a fused to the MHC 
class li epitope string shown in Figure 2 and the transmembrane and cytoplasmic domains 
of Ig-a was generated (Figure 9). The mouse Ig-a gene has been described previously 
(Kashiwamura et al, J. Immunol 145:337-343 (1990)), which is incorporated herein by 
reference (GenBank accession No. M31773). This construct, designated Ig-alphaTh, was 
constructed as depicted in Figure 18. The primer IF used was the oligonucleotide 
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designated Ig alpha-lF, GCG GCT AGC GCC GCC ACC ATG CCA GGG GGT CTA 
(SEQ ID NO:70). The primer IR used was the oligonucleotide designated Igalpha-IR, 
GCA CAG CCT GGC TGA TGG CCT GGC ATC CGG (SEQ ID N0:7 1). The primer 2F 
used was the oligonucleotide designated Igalpha-2F, CTG AAG GCT GCC GCT GGG 
5 ATC ATC TTG CTG (SEQ ID NO:72). The primer 2R used was the oligonucleotide 
designated Igalpha-2R, GCG GGT ACC TC A TGG CTT TTC CAG CTG (SEQ ID 
NO:73). 

ADNA construct containing the signal sequence of Ig-P fused to the MHC 
class II string shown in Figure 2 and the transmembrane and cytoplasmic domains of IgP 

10 was generated (Figure 10). The Ig-p sequence is the B29 gene of mouse and has been 
described previously (Hermanson et al. Proc, Natl Acad, ScL USA 85:6890-6894 
(1988)), which is incorporated herein by reference (GenBank accession No. J03857). 
This construct, designated Ig-betaTh, was constructed as depicted in Figure 18. The 
primer IF used was the oHgonucleotide designated B29-1F (33mer) GCG GCT AGC 

15 GCC GCC ACC ATG GCC ACA CTG GTG (SEQ ID NO:74). The primer IR used was 
the oligonucleotide designated B29-1R (30mer) CAC AGC CTG GCT GAT CGG CTC 
ACC TGA GAA (SEQ ID NO:75). The primer 2F used was the oligonucleotide 
designated B292F (30mer) CTG AAG GCT GCC GCT ATT ATC TTG ATC CAG (SEQ 
ID NO: 76). The primer 2R used was the oligonucleotide designated B29-2R (27mer), 

20 GCC GGT ACC TCA TTC CTG GCC TGG ATG (SEQ ID NO:77). 

ADNA constmct containing the signal sequence of the kappa 
immunoglobulin signal sequence fused to the MHC class II epitope string shown in 
Figure 2 was constructed (Figure 11). This construct is designated SigTh and was 
generated by using the kappaLAMP-Th construct (shown in Figure 4) and amplifying 

25 with the primer pair KappaSig-F (SEQ ID NO:54) plus Help-epR (SEQ ID NO:47) to 
create SigTh. SigTh contains the kappa immunoglobulin signal sequence fused to the T 
helper epitope string and terminated with a translational stop codon. 

Constructs encoding human sequences corresponding to the above 
described constructs having mouse sequences are prepared by substituting human 

30 sequences for the mouse sequences. Briefly, for the liPADRE construct, corresponding to 
Figure 1, amino acid residues 1-80 from the human li gene HLA-DR sequence (Figure 
12) (GenBank accession No. X00497 M14765) is substituted for the mouse li sequences, 
which is fused to PADRE, followed by human invariant chain HLA-DR amino acid 
residues 1 14-223. For the I80T construct, corresponding to Figure 2, amino acid residues 
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1-80 from the human sequence of li is followed by a MHC class II epitope string. For the 
liThfulI construct, corresponding to Figure 3, amino acid residues 1-80 from the human 
sequence of li, which is fused to a MHC class II epitope string, is followed by human 
invariant chain amino acid residues 114-223. 

5 For the LAMP-Th construct, similar to Figure 4, the signal sequence 

encoded by amino acid residues 1-19 (nucleotides 11-67) of human LAMP-l (Figure 13) 
(GenBank accession No. J04182), which is fused to the MHC class II epitope string, is 
followed by the transmembrane (nucleotides 1 163-1213) and cytoplasmic tail 
(nucleotides 1214-1258) region encoded by amino acid residues 380-416 of human 

10 LAMP-l. 

For the HLA-DM-Th construct, corresponding to Figure 5, the signal 
sequence encoded by amino acid residues 1-17 (nucleotides 1-51) of human HLA-DMB 
(Figure 14) (GenBank accession No. U15085), which is fused to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 646-720) and cytoplasmic tail 
15 (nucleotides 721-792) region encoded by amino acid residues 216-263 of human HLA- 
DMB. 

For the HLA-DO-Th construct, corresponding to Figure 6, the signal 
sequence encoded by amino acid residues 1-21 (nucleotides 1-63) of human HLA-DO 
(Figure 1 5) (GenBank accession No. L29472 J02736 N00052), which is fused to the 

20 MHC class II epitope siring, is followed by the transmembrane (nucleotides 685-735) and 
cytoplasmic tail (nucleotides 736-819) region encoded by amino acid residues 223-273 of 
human HLA-DO. 

For the Ig-alphaTh constmct, corresponding to Figure 9, the signal 
sequence encoded by amino acid residues 1-29 (nucleotides 1-87) of human Ig-a MB-1 

25 (Figure 1 6) (GenBank accession No. U05259), which is fused to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 424-498) and cytoplasmic tail 
(nucleotides 499-678) region encoded by amino acid residues 142-226 of human Ig-a 
MB-1. 

For the Ig-betaTh construct, corresponding to Figure 10, the signal 
30 sequence encoded by amino acid residues 1-28 (nucleotides 17-100) of human Ig-p B29 
(Figure 17) (GenBank accession No. M80461), which is fused to the MHC class II 
epitope string, is followed by the transmembrane (nucleotides 500-547) and cytoplasmic 
tail (nucleotides 548-703) region encoded by amino acid residues 156-229 of human Ig-p. 
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The SigTh construct shown in Figure 1 1 can be used in mouse and human. 
Alternatively, a signal sequence derived from an appropriate human gene containing a 
signal sequence can be substituted for the mouse kappa inmiunoglobulin sequence in the 
Sig Th construct. 

The PADRE-Influenza matrix construct shown in Figure 7 and the 
PADRE-HBVs construct shown in Figure 8 can be used in mouse and human. 

Some of the DNA constructs described above were cloned into the vector 
pEP2 (Figure 19; SEQ ID NO:35). The pEP2 vector was constructed to contain dual 
CMV' promoters. The pEP2 vector used the backbone of pcDNA3.1(-)Myc-His A from 
Invitrogen and pIRESlhyg from Clontech. Changes were made to both vectors before the 
CMV transcription unit from pIRES Ihyg was moved into the modified pcDNA vector. 

The pcDNA3.1(-)Myc-His A vector (http://www.invitrogen,com) was 
modified. Briefly, the PvuII fragment (nucleotides 1342-3508) was deleted. ABspHI 
fragment that contains the Ampicillin resistance gene (nucleotides 4404-5412) was cut 
out. The Ampicillin resistance gene was replaced with the kanamycin resistance gene 
from.pUC4K (GenBank Accession #X06404). pUC4K was amplified with the primer set: 
TCTGATGTTACATTGCACAAG (SEQ ID NO:78) (nucleotides 1621-1601) and 
GCGCACTCATGATGCTCTGCCAGTGTTACAACC (SEQ ID NO:79) (nucleotides 
682-702 plus the addition of a BspHI restriction site on the 5 ' end). The PGR product 
was digested with Bspffl and ligated into the vector digested with BspHI. The region 
between the Pmel site at nucleotide 905 and the EcoRV site at nucleotide 947 was 
deleted. The vector was then digested with Pmel (cuts at nucleotide 1076) and Apal (cuts 
at nucleotide 1004), Klenow filled in at the cohesive ends and ligated. The Kpnl site at 
nucleotide 994 was deleted by digesting with Kpnl and filling in the ends with Klenow 
DNA polymerase, and ligating. The intron A sequence from CMV (GenBank accession 
M21295, nucleotides 635-1461) was added by amplifying CMV DNA with the primer set: 
GCGTCTAGAGTAAGTACCGCCTATAGACTC (SEQ ID NO:80) (nucleotides 635-655 
plus an Xbal site on the 5' end) and CCGGCTAGCCTGCAGAAAAGACCCATGGAA 
(SEQ ID N0:81) (nucleotides 1461-1441 plus anNhel site on the 3' end). The PGR 
product was digested with Xbal and Nhel and ligated into the Nhel site of the vector 
(nucleotide 895 of the original pcDNA vector) so that the Nhel site was on the 3' end of 
the intron. 

To modify the pIRESlhyg vector (GenBank Accession U89672, 
Clontech), the Kpnl site (nucleotide 911) was deleted by cutting and filling in with 
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Klenow. The plasmid was cut with NotI (nucleotide 1254) and Xbal (nucleotide 3 196) 
and a polylinker oligo was inserted into the site. The polylinker was formed by annealing 
the following two oligos: 

GGCCGCAAGGAAAAAATCTAGAGTCGGCCATAGACTAATGCCGGTACCG(SEQ 
IDNO:82) and 

CTAGCGGTACCGGCATTAGTCTATGGCCCGACTCTAGATTTTTTCCTTGC(SEQ 
ED NO:83). The resulting plasmid was cut with Hindi and the fragment between Hindi 
sites 234 and 3538 was isolated and ligated into the modified pcDNA vector. This 
fragment contains a CMV promoter, intron, polylinker, and polyadenylation signal. 

The pIREShyg piece and the pcDNA piece were combined to form p£P2. 
The modified pcDNA3,l(-)Myc-His A vector was partially digested with PvuII to isolate 
a linear fragment with the cut downstream of the pcDNA polyadenylation signal (the 
other PvuII site is the CMV intron). The Hindi fragment from the modified pIRESlhyg 
vector was ligated into the PvuII cut vector. The polyadenylation signal from the pcDNA 
derived transcription unit was deleted by digesting with EcoRI (pcDNA nucleotide 955) 
and Xhol (pIRESlhyg nucleotide 3472) and replaced with a synthetic polyadenylation 
sequence. The synthetic polyadenylation signal was described in Levitt et al, Genes and 
Development yAm'\(yiS{\9Z9)). 

Two oligos were annealed to produce a fragment that contained a 
polylinker and polyadenylation signal with EcoRI and Xhol cohesive ends. The oligos 
were: 

AATTCGGATATCCAAGCTTGATGAATAAAAGATCAGAGCTCTAGTGATCTGTGT 
GTTGGTTTTTTTGTGTGC (SEQ ID NO:84) and 

TCGAGCACACAAAAAACCAACACACAGATCACTAGAGCTCTGATCTTTTTATT 

CATCAAGCTTGGATATCCG (SEQ ID NO:85). 

The resulting vector is named pEP2 and contains two separate 
transcription units. Both transcription units use the same CMV promoter but each 
contains different intron, polylinker, and polyadenylation sequences. 

The pEP2 vector contains two transcription units. The first transcription 
unit contains the CMV promoter initially from pcDNA (nucleotides 210-862 in Figure 
19), CMV intron A sequence (nucleotides 900-1728 in Figure 1 9), polylinker cloning site 
(nucleotides 1740-1760 in Figure 19) and synthetic polyadenylation signal (nucleotides 
1764-1769 in Figure 19). The second transcription unit, which was initially derived from 
pIRESlhyg, contains the CMV promoter (nucleotides 3165-2493 iii Figure 19), intron 
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sequence (nucleotides 2464-2173 in Figure 19), polylinker clone site (nucleotides 2126- 
2095 in Figure 19) and bovine growth hormone polyadenylation signal (nucleotides 1979- 
1974 in Figure 19). The kanamycin resistance gene is encoded in nucleotides 4965-4061 
(Figure 19). 

The DNA constructs described above were digested with Nhel and Kpnl 
and cloned into the Xbal and Kpnl sites of pEP2 (the second transcription unit). 

Additional vectors were also constructed. To test for the effect of co- 
expression of MHC class I epitopes with MHC class 11 epitopes, an insert was generated, 
designated AOS, that contains nine MHC class I epitopes. The AOS insert was initially 
constructed in the vector pMIN.O (Figure 20; SEQ ID NO:36). Briefly, the AOS insert 
contains nine MHC class I epitopes, six restricted by HLA-A2 and three restricted by 
HLA-AU, and the universal MHC class II epitope PADRE. The vector pMIN.O contains 
epitopes from HBV, HIV and a mouse ovalbumin epitope. The MHC class I epitopes 
appear in pMIN.O in the following order: 

consensus mouse Ig Kappa signal sequence (pMIN.O amino acid residues 
1-20, nucleotides 16-81) MQVQIQSLFLLLLWYPQSRG (SEQ ID Np.:86) encodeciby 
nucleotides ATG CAG GTG CAG ATC CAG AGC CTG TTT CTG CTC CTC CTG TGG 
GTG CCC GGGTCC AGAGGA (SEQ ID. NO:87); . 

HBV pol 149-1 59 (All restricted) 

(pMIN.O amino acid residues 21-31, nucleotides 82-114) 
HTLWKAGILYK (SEQ ID NO:88) encoded by nucleotides CAC ACC CTG TGG AAG 
GCC GGA ATC CTG TAT AAG (SEQ ID NO:89); 

PADRE-universal MHC class II epitope (pMIN.O amino acid residues 32- 
45, nucleotides 115-153) AKFVAAWTLKAAA (SEQ ID NO:38) encoded by nucleotides, 
GCC AAG TTC GTG GCT GCC TGG ACC CTG AAG GCT GCC GCT (SEQ ID 
NO:90); 

HBV core 18-27 (A2 restricted) (pMIN.O amino acid residues 46-55, 
nucleotides 154-183) FLPSDFFPSV (SEQ ID N0:91) encoded by nucleotides TTC CTG 
CCT AGC GAT TTC TTT CCT AGC GTG (SEQ ID NO:92); 

HTV env 120-128 (A2 restricted) (pMIN.O amino acid residues 56-64, 
nucleotides 184-210) KLTPLCVTL (SEQ ID NO:93) encoded by nucleotides AAG CTG 
ACC CCA CTG TGC GTG ACC CTG (SEQ ID NO:94); 
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HBV pol 551-559 (A2 restricted) (pMIN.O amino acid residues 65-73, 
nucleotides 211-237) YMDDVVLGA (SEQ ID NO:95) encoded by nucleotides TAT ATG 
GAT GAC GTG GTG CTG GGA GCC (SEQ ID NO:96); 

mouse ovalbumin 257-264 (K** restricted) (pMIN.O amino acid residues 
74-81, nucleotides 238-261) SUNFEKL (SEQ ID NO:97) encoded by nucleotides AGC 
ATC ATC AAC TTC GAG AAG CTG (SEQ ID NO:98); 

HBV pol 455-463 (A2 restricted) (pMIN.O amino acid residues 82-90, 
nucleotides 262-288) GLSRYVARL (SEQ ID NO:99) encoded by nucleotides GGA CTG 
TCC AGATAC GTG GCT AGG CTG (SEQ ED NO: 100); 

HIV pel 476-84 (A2 restricted) (pMIN.O amino acid residues 91-99, 
nucleotides 289-315) HKEPVHGV (SEQ ID NO: 101) encoded by nucleotides ATC CTG 
AAG GAG CCT GTG CAC GGC GTG (SEQ ID NO:102); 

HBV core 141-151 (All restricted) 

(pMIN.O amino acid residues 100-110, nucleotides 316-348) 
STLPETTWRR (SEQ ID NO: 103) encoded by nucleotides TCC ACC CTG CCA GAG 
ACC ACC GTG GTG AGG AGA (SEQ ID NO: 104); 

HIV env 49-58 (All restricted) (pMIN.O amino acid residues 1 1 1-120, 
nucleotides 349-378) TVYYGVPVWK (SEQ ID NO: 1 05) encoded by nucleotides ACC 
GTG TAG TAT GGA GTG CCT GTG TGG AAG (SEQ ID NO:106); and 

HBV env 335-343 (A2 restricted) (pMIN.O amino acid residues 121-129, 
nucleotides 378-405) WLSLLVPFV (SEQ ID NO:107) encoded by nucleotides TGG 
CTG AGC CTG CTG GTG CCC TTT GTG (SEQ ID NO: 108). 

The pMIN.O vector contains a Kpnl restriction site (pMIN.O nucleotides 
406-411) and a Nhel restriction site (pMIN.O nucleotides 1-6). The pMIN.O vector 
contains a consensus Kozak sequence (nucleotides 7-18) (GCCGCCACCATG; SEQ ID 
NO: 109) and murine Kappa Ig-light chain signal sequence followed by a string of 10 
MHC class I epitopes and one universal MHC class II epitope. The pMIN.O sequence 
encodes an open reading frame fiised to the Myc and His antibody epitope tag coded for 
by the pcDNA 3.1 Myc-His vector. The pMIN.O vector was constructed with eight 
oligonucleotides: 

Mini oligo 

GAGGAGCAGAA.\CAGGCTCTGGATCTGCACCTGCATTCCCATGGTGGCGGCGC 
TAGCAAGCTTCTTGCGC (SEQ ID NO:110); 
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Min2 oligo 

CCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATA (SEQ ID NO: 111); 
Min3 oligo 

5 TCGCTAGGCAGGAAAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGG 
CCTTATACAGGATTCCGG (SEQ E) NO: 112); 
Min4 oligo 

CTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGTGA 
CCCTGTATATGGATGAC (SEQ ID NO: 11 3); 

10 Min5 oligo 

CGTACCTGGACAGTCCCAGCTTCTCGAAGTTGATGATGCTGGCT 

CCCAGCACCACGTCATCCATATACAG (SEQ ID N0:1 14); 
Min6 oligo 

GGACTGTCCAGATACGTGGCTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGT 
15 GTCCACCCTGCCAGAGAC(SEQIDN0:115); 
Min7 oligo 

GCTCAGCCACTTCCACACAGGCACTCCATAGTACACGGTCCTCCTCACCACGG 
TGGTCTCTGGCAGGGTG (SEQ ID N0:116); 
Min8 oligo 

20 GTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACCTGATCTAGAGC 
(SEQ ID NO: 11 7). 

Additional primers were flanking primer 5 GCG CAA GAA GCT TGC 
TAG CG (SEQ ID NO: 1 1 8) and flanking primer 3 GCT CTA GAT CAG GTA CCC 
CAC(SEQIDN0:119). 

25 The original pMIN.O minigene constriiction was carried out using eight 

overlapping oligos averaging approximately 70 nucleotides in length, which were 
synthesized and HPLC purified by Operoii Technologies Inc. Each oligo overlapped its 
neighbor by 15 nucleotides, and the final multi-epitope minigene was assembled by 
extending the overlapping oligos in three sets of reactions using PGR (Ho et al. Gene 

30 77:51-59(1989). 

For the first PGR reaction, 5 ng of each of two oligos were annealed and 
extended: 1+2, 3+4, 5+6, and 7+8 were combined in 100 ^1 reactions containing 0.25 mM 
each dNTP and 2.5 units of Pfii polymerase in Pfii polymerase buffer containing 10 mM 
KCl, 10 mM (NH4)2S04, 20 mM Tris-chloride, pH 8.75. 2 mM MgSO*. 0.1% TRITON 
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X-100 and 100 mg/ml BSA. A Perkin/Elmer 9600 PGR machine was used and the 
annealing temperature used was 5°C below the lowest calculated Tm of each primer pair. 
The full length dimer products were gel-purified, and two reactions containing the 
product of 1-2 and 3-4, and the product of 5-6 and 7-8 were mixed, annealed and 
extended for 10 cycles. Half of the two reactions were then mixed, and 5 cycles of 
annealing and extension carried out before flanking primers were added to amplify the 
full length product for 25 additional cycles. The full length product was gel purified and 
cloned into pCR-blunt (Invitrogen) and individual clones were screened by sequencing. 
The Min insert was isolated as an Nhel-Kpnl fragment and cloned into the same sites of 
pcDNA3 . 1 (-)/Myc-His A (Invitrogen) for expression. The Min protein contains the Myc 
and His antibody epitope tags at its carboxyl-terminal end. 

For all the PCR reactions described, a total of 30 cycles were performed 
using Pfu polymerase and the following conditions: 95*^0 for 15 seconds, annealing 
temperature for 30 seconds, 72°C for one minute. The annealing temperature used was 
S^^C below the lowest calculated Tm of each primer pair. 

Three changes to pMIN.O were made to produce pMIN.l (Figure 21; SEQ 
ID NO:37, also referred to as pMIN-AOS). The mouse ova epitope was removed, the 
position 9 alanine anchor residue (#547) of HB V pol 55 1-560 was converted to a valine 
which increased the in vitro binding affinity 40-fold, and a translational stop codon was 
introduced at the end of the multi-epitope coding sequence. The changes were made by 
amplifying two overlapping firagments and combining them to yield the fiill length 
product. 

The first reaction used the 5' pcDNA vector primer T7 and the primer Min- 
ovaR (nucleotides 247-218) TGGACAGTCCCACTCCCAGCACCACGTCAT (SEQ ID 
NO:120). The 3' half was ampUfied with the primers: Min-ovaF (nucleotides 228-257) 
GCTGGGAGTGGGACTGTCCAGGTACGTGGC (SEQ ID NO: 121) and Min-StopR 
(nucleotides 390-361) GGTACCTCACACAAAGGGCACCAGCAGGC (SEQ ID 
NO:122) 

The two firagments were gel purified, mixed, denatured, annealed, and 
filled in with five cycles of PCR. The fiill length fi-agraent was amplified with the 
flanking primers T7 and Min-Stop for 25 more cycles. The product was gel purified, 
digested with Nhel and Kpnl and cloned into pcDNA3.1 for sequencing and expression. 
The insert fi-om pMin.l was isolated as aa Nhel-Kpnl firagment and cloned into pEP2 to 
makepEP2-A0S. 
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EXAMPLE IT- Assay for T Helper Cell Activation 

This example shows methods for assaying T helper cell activity. One 
method for assaying T helper cell activity uses spleen cells of an immunized organism. 
5 Briefly, a spleen cell pellet is suspended with 2-3 ml of red blood cell lysis buffer 
containing 8.3 g/liter ammonium chloride in 0.001 M Tris^HCl, pH 7.5. The cells are 
incubated in lysis buffer for 3-5 min at rbom temperature with occasional vortexing. An 
excess volume of 50 ml of RI O medium is added.to the cells, and the cells are pelleted. 
The cells are resuspended and pelleted one or two more times in R2 medium or RIO 

— 10- •■ medium.-.- r..- . , 

The cell pellet is suspended in RIO medium and counted. If the cell 

suspension is aggregated, the aggregates are removed by filtration or by allowing the 
aggregates to settk by gravity. The cell concentration is brought to lOVml, and lOOjil of ' 
spleen cells are added to 96 well flat bottom plates. 
15 Dilutions of the appropriate peptide, such as pan DR epitope (SEQ ID 

NO:145), are prepared in RIO medium at 100, 10, 1, 0.1 and 0.01 |tg/ml, and 100 ]il of 
peptide are added to duplicate or triplicate wells of spleen cells. The final peptide 
concentration is 50, 5, 0.5, 0.05 and 0.005 |ig/ml. Control wells receive 100 0 RIO 
medium. 

20 The plates are incubated for 3 days at 37°G. After 3 days, 20^1 of 

50 |iCi/ml ^H-thymidine is added per well. Cells, are incubated for 1 8-24 hours and then 
harvested oiitd glass fiber filters. The incorporation of %-thymidine into DNA of 
: proliferating cells is measured in a beta counter. 

A second assay for T helper cell activity uses peripheral bb^ 
25. mononuclear cells ^BMC) that are stimulated in vitro as described in Alexander aL, 
. supra and Sette (WO 95/07,707), as adapted fi-pm Manca et aL. J. Immunol. 146:1964- 
1971 (1 991), which is incorporated herein by reference. Briefly, PBMC are collected 
from healthy donors and purified over Ficoll-Plaque (Pharmacia Biotech; Piscataway, 
NJ). PBMC are plated in a 24 well tissue culture plate at 4 x 1 0* cells/ml. Peptides are 
30 added at a final concentration of 10 |ig/ml. Cultures are incubated at 37°C in.5% 002. 

On day 4, recombinant interieukin-2 (IL-2) is added at a final 
concentration of 10 ng/ml. Cultures are fed every 3 days by aspirating 1 ml of medium 
and replacing with fi-esh medium containing IL-2. Two additional stimulations of the T 
cells with antigen are performed on approximately days 14 and 28. The T cells (3 x 
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lOVwell) are stimulated with peptide (10 fig/ml) using autologous PBMC cells (2 x lO'' 
irradiated cells/well) (irradiated with 7500 rads) as antigen-presenting cells in a total of 
three wells of a 24 well tissue culture plate. In addition, on day 14 and 28, T cell 
proliferative responses are determined under the following conditions: 2 x 1 0* T 

5 cells/well; 1x10^ irradiated PBMC/well as antigen-presenting cells; peptide 

concentration varying between 0.0 1 and 1 0 fig/ml final concentration. The proliferation 
of the T cells is measured 3 days later by the addition of ^H-thymidine (1 |iCi/well) 1 8 hr 
prior to harvesting the cells. Cells are harvested onto glass filters and ^H-thymidine 
incorporation is measured in a beta plate counter. These results demonstrate methods for 

10 assaying T helper cell activity by measuring ^H-thymidine incprppratipn, 

EXAMPLE ITT: Assav for Cytotoxic T I .vmphncvte Response 

This example shows a method for assaying cytotoxic T lymphocyte (CTL) 
activity. A CTL response is measured essentially as described previously (Vitiello et al, 
15 Eur. J. Immunol (1997), which is incorporated herein by reference). Briefly, 

after approximately 10-35 days following DN A immunization, splenocytes from an 
animal are isolated and co-cultured at 37°C with syngeneic, irradiated (3000 rad) peptide- 
coated LPS blasts (I x lO** to 1.5 x 10* cells/ml) in 10 ml RIO in T25 flasks. LPS blasts 
are obtained by activating splenocytes (1 x 10* to 1.5 x 10* cells/ml) with 25 ng/ml 
20 lipopolysaccharides (LPS) (Sigma cat. no. L-2387; St. Louis, MO) and 7 ng/ml dextran 
sulfate (Pharmacia Biotech) in 30 ml RIO medimn in T75 flasks for 3 days at 37°C. The 
lymphoblasts are then resuspended at a concentration of 2.5 x 10^ to 3.0 x lO'/ml, 
irradiated (3000 rad), and coated with the appropriate peptides (lOOng/ml) for 1 h at 
37?C. Cells are washed once, resuspended in RIO medium at the desired concentration 
25 and added to the responder cell preparation. Cultures are assayed for cytolytic activity on 
day 7 in a *'Cr-release assay. 

For the ^'Cr-release assay, target cells are labeled for 90 min at 37°C with 
1 50 nl sodium ^'chromate C'Cr) (New England Nuclear; Wihnington DE), washed three 
times and resuspended at the appropriate concentration in RIO medium. For the assay, 
30. 10" target cells are incubated in the presence of different concentrations of effector cells 
in a final volume of 200 \i\ in U-bottom 96 well plates in the presence or absence of 1 0 
Hg/ml peptide. Supematants are removed after 6 h at 37°C, and the percent specific lysis 
is determined by the formula: percent specific lysis = 100 x (experimental release - 
spontaneous release), (.maximum release - spontaneous release). To facilitate comparison 
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of responses from different . experiments, the percent release data is transformed to lytic 
units 30 per 10^ cells (LU30/10^), with 1 LU30 defined as the number of effector cells 
required to induce 30% lysis of 10"^ target cells in a 6 h assay. LU values represent the 
LU30/iO^ obtained in the presence of peptide minus LU30/10^ in the absence of peptide. 
These results demonstrate methods for assaying CTL activity by measuring ^^Cr release 
from cells. 

EXAMPLE IV: T Cell Proliferation in Mice Inmiuni zed with Expression Vectors 
Encoding MHC Class II Epitopes and MHC Class II Targeting Sequences 

This example demonstrates that expression vectors encoding MHC class II 
epitopes and MHC class II targeting sequences are effective at activating T cells. 

The constructs used in the T cell proliferation assay are described in 
Example I and were cloned into the vector pEP2, a CMV driven expression vector. The 
peptides used for T cell in vitro stimulation are: Ova 323-339, ISQAVHAAHAEINEAGR 
(SEQ ID NO:123); HBVcorel28, TPPAYRPPNAPILF (SEQ ID NO:124); HBVenvl82, 
FFLLTRILTIPQSLD (SEQ ID NO: 125); and PADRE, AKFVAAWTLKAAA (SEQ ID 
NO:38). 

T cell proliferation was assayed essentially as described in Example II. 
Briefly, 12 to 16 week old B6D2 Fl mice (2 mice per construct) were injected with 100 

of the indicated expression vector (50 |ig per leg) in the anterior tibialis muscle. After 
eleven days, spleens were collected from the mice and separated into a single cell 
suspension by Dounce homogenization. The splenocytes were counted and one million 
splenocytes were plated per well in a 96-well plate. Each sample was done in triplicate. 
Ten jig/ml of the corresponding peptide encoded by the respective expression vectors was 
added to each well. One well contained splenocytes without peptide added for a negative 
control. Cells were cultured at 37^*0, 5% CO2 for three days. 

After three days, one jiCi of ^H-thymidine was added to each well After 
18 hours at 37^C, the cells were harvested onto glass filters and incorporation was 
measured on an LKB P plate counter. The results of the T cell proliferation assay are 
shown in Table 9. Antigenspecific T cell proliferation is presented as the stimulation 
index (SI); this is defined as the ratio of the average ^H-thymidine incorporation in the 
presence of antigen divided by the ^H-thymidine incorporation in the absence of antigen. 

The immunogen "PADRE + IFA" is a positive control where the PADRE 
peptide in incomplete Freund's adjuvant was injected into the mice and compared to the 
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response seen by injecting the MHC class 11 epitope constructs containing a PADRE 
sequence. As shown in Table 9, most of the expression vectors tested were effective at 
activating T cell proliferation in response to the addition of PADRE peptide. The activity 
of several of the expression vectors was comparable to that seen with immunization with 
the PADRE peptide in incomplete Freund's adjuvant. The expression vectors containing 
both MHC class I and MHC class U epitopes, pEP2-A0S and pcDNA-AOS, were also 
efifective at activating T cell proliferation in response to the addition of PADRE peptide. 

These results show that expression vectors encoding MHC class II 
epitopes fused to a MHC class II targeting sequence is effective at activating T cell 
proUferation and are useful for stimulating an immune response. 

EXAMPLE V: In vivo assay Using Transgenic Mice 
A. Materials and methods 

Peptides were synthesized according to standard F-moc solid phase 
synthesis methods w hich have been previously described (Ruppert et ai, Cell 74:929 
(1993); Settee/ a/., MoL Immunol. 31:813 (1994)). Peptide purity was determined by 
analytical reverse-phase HPLC and purity was routinely >95%. Synthesis and 
purification of the Theradigm-HBV lipopeptide vaccine is described in (Vitiello et ai, J. 
C/m./nve5r. 95:341 (1995)). 

Mice 

HLA-A2.1 transgenic mice used in this study were the Fl generation 
derived by crossing transgenic mice expressing a chimeric gene consisting of the al, a2 
domains of HLA-A2. 1 and a3 domain of H-2K^ with SJL/J mice (Jackson Laboratory, 
Bar Harbor, ME). This strain will be referred to hereafter as HLA-A2.1/K*'-H-2^'''. The 
parental HLA-A2.L'K^ transgenic strain was generated on a C57BL/6 background using 
the transgene and methods described in (Vitiello et al, J. Exp, Med. 173:1007 (1991)). 
HLA-Al 1/K^ transgenic mice used in the current study were identical to those described 
in (Alexander et ai, J, Immunol 159:4753 (1997)). 

Cell lines. MHC purification, and p eptide bindine assay 
Target cells for peptide-specific cytotoxicity assays were Jurkat cells 
transfected with the HLA-A2.1/K^ chimeric gene (Vitiello era/., J. Exp. Med. 173:1007 
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(1991)) and .221 tumor cells transfected with HLA-Al I/K" (Alexander et al. J. Immunol. 
159:4753 (1997)). 

To measure presentation of endogenously processed epitopes, Jurkat- 
AZ-l/K** cells were transfected with the pMin.l or pMin.2-GFP minigenes then tested in a 
cytotoxicity assay against epitope-specific CTL lines. For transfection, Jurkat-A2.1/K'' 
cells were resuspended at lO' cells/ml and 30 [ig of DNA was added to 600 ^1 of cell 
suspension. After electroporating .cells in a 0.4 cm cuvette at 0.25 kV, 960 jiFd, cells 
were incubated on ice for 10 min then cultured for 2 d in RPMI culture medium. Cells 
were then cultured in medium containing 200 U/ml hygromycin B (Calbiochem, San 
Diego CA) to select for stable transfectants. FACS was used to enrich the fraction of 
green fluorescent protein (GFP)-expressing cells from 15% to 60% (data not shown). 

Methods for measuring the quantitative binding of peptides to purified 
HLA-A2.1 and -Al 1 molecules is described m Ruppert et al., Cell 74:929 (1993); Sette et 
al., Mol. Immunol. 31:813 (1994); Alexander et al, J. Immunol. 159:4753 (1997). 

All tumor cell lines and splenic CTLs from primed mice were grown in 
culture medium (CM) that consisted of RPMI 1640 medium with Hepes (Life 
Technologies, Grand Island, NY) supplemented with 10% FBS, 4 mM L-glutamine, 5 X 
10"^ M 2-ME, 0.5 mM sodium pyruvate, 100 ng/ml streptomycin, and 100 U/ml 
penicillin. 

Construction of minigene multi-e pitope DNA plasmids 
pMIN.O and pMIN.l (i.e., pMIN-AOS) were constructed as described 
above and in USSN 60/085,751. 

pMin.l-No PADRE and oMin l -Anchor. pMin.1 was amplified using two 
overiqjping fragments which was then combined to yield the full length product. The 
first reaction used the 5 ' pcDNA vector primer 17 and either primer 
ATCGCTAGGCAGGAACTTATACAGGATTCC (SEQ ID NO:126) for pMin.l-No 
PADRE or TGGACAGTCCGGCTCCCAGCACCACGT (SEQ ID NO: 127) for pMin.l- 
Anchor. The 3' half was amplified with the primers TTCCTGCCTAGCGATTTC (SEQ 
ID NO: 128) (No PADRE) or GCTGGGAGCCGGACTGTCCAGGTACGT (SEQ ID 
NO:129) (Anchor) and Min-StopR. The two fragments generated from amplifying the 5 ' 
and 3' ends were gel purified, mixed, denatured, annealed, and filled in with five cycles 
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of PCR. The full length fragment was funner amplified with the flanking primers T7 and 
Min-StopR for 25 more cycles. 

pMin.l-No Sis. The Ig signal sequence was deleted from pMin.l by PCR 
amplification with primer GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGC 
CGGAATC (SEQ ID NO: 130) and pcDNA rev (Invitrogen) primers. The product was 
cloned into pCR-blunt and sequenced. 

pMin.l -Switch. Three overlapping fragments were amplified from 
pMin. 1 , combined, and extended. The 5 ' fragment was amplified with the vector primer 
T7 and primer GGGC.ACCAGCAGGCTCAGCCACACTCCCAGCACCACGTC (SEQ 
ID N0:13 1). The second overlapping fragment was amplified with primers 
AGCCTGCTGGTGCCCTTTGTGATCCTGAAGGAGCCTGTGC (SEQ ID NO:132) 
and AGCCACGTACCTGGACAGTCCCTTCCACACAGGCACTCGAT (SEQ ID 
NO: 133). Primer TGTCCAGGTACGTGGCTAGGCTGTGAGGTACC (SEQ ID 
NO: 1 34) and the vector primer pcDNA rev (Invitrogen) were used to amplify the third 
(3') fragment. Fragments 1, 2, and 3 were amplified and gel purified. Fragments 2 and 3 
were mixed, annealed, amplified, and gel purified. Fragment 1 was combined with the 
product of 2 and 3, and extended, gel purified and cloned into pcDNA3.1 for expression. 

pMin.2-GFP. The signal sequence was deleted from pMin.O by PCR 
amplification with Min.O-No Sig-5' plus pcDNA rev (Invitrogen) primers 
GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGCCGGAATC (SEQ ID 
N0:135). The product was cloned into pCR-blunt and sequenced. The insert containing 
the open reading frame of the signal sequence-deleted multi-epitope construct was cut out 
with Nhel plus Hindni and ligated into the same sites of pEGFPNl (Clontech). This 
construct fiises the coding region of the signal-deleted pMin.O construct to the N-terminus 
of green fluorescent protein (GFP). 

Immunization of mice 

For DNA immunization, mice were pretreated by injecting 50 (il of 10 nM 
cardiotoxin (Sigma Chem. Co., #C9759) bilaterally into the tibialis anterior muscle. Four 
or five days later, 100 ng of DNA diluted in PBS were injected m the same muscle. 
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Theradigm-HBV lipopeptide (10 mg/ml in DMSO) that was stored at - 
was thawed for 10 min at 45°C before being diluted 1:10 (v/v) with room 
temperature PBS. Immediately upon addition of PBS, the lipopeptide suspension was 
vortexed vigorously and 100 \il was injected s.c. at the tail base (100 ^g/mouse). 
5 Immunogenicity of individual CTL epitopes was tested by mixing each 

CTL epitope (50 ^ig/mouse) with the HBV core 128-140 peptide (TPPAYRPPNAPIL 
(SEQ ID NO:124), 140 jig/mouse) which served to induce I-A^'-restricted Th cells. The 
peptide cocktail was then emuslifed in incomplete Freund's adjuvant (Sigma Chem. Co.) 
and 100 \i\ of peptide emulsion was injected sx. at the tail base. 

10 

In vitro CTL cultures and cytotoxicity assays 

Eleven to 14 days after immunization, animals were sacrificed and a single 
cell suspension of splenocytes prepared. Splenocytes from cDNA-primed animals were 
stimulated in vitro with each of the peptide epitopes represented in the minigene. 

15 Splenocytes (2.5-3.0 X lOVflask) were cultured in upright 25 cm^ flasks in the presence 
of 10 |ig/ml peptide and lO'' irradiated spleen cells that had been activated for 3 days with 
LPS (25 |ig/ml) and dextran sulfate (7 jig/ml). Triplicate cultures were stimulated with 
each epitope. Five days later, cultures were fed with fresh CM. After 10 d of m vitro 
culture, 2-4 X 10^ CTLs from each flask were restimulated with 10^ LPS/dextran sulfate- 

20 activated splenocytes treated with 100 ng/ml peptide for 60-75 min at 37°C, then 

irradiated 3500 rads, CTLs were restimulated in 6-well plates in 8 ml of cytokine-free 
CM. Eighteen hr later, cultures received cytokines contained in con A-activated 
splenocyte supernatant (10-15% final concentration, v/v) and were fed or expanded on the 
third day with CM containing 10-15% cytokme supemate. Five days after restimulation, 

25 CTL activity of each culture was measured by incubating varying numbers of CTLs with 
1 0"^ ^*Cr-labelled target cells m the presence or absence of peptide. To decrease 
nonspecific cytotoxicity from NK cells, YAC-1 cells (ATCC) were also added at a YAC- 
l:^^Cr4abeled target cell ratio of 20:1. CTL activity against the HBV Pol 551 epitope 
was measured by stimulating DNA-primed splenocytes in vitro with the native A- 

30 containing peptide and testmg for cytotoxic activity against the same peptide. 

To more readily compare responses, the standard E:T ratio vs % 
cytotoxicity data cun^es were converted into LU per 10^ effector cells with one LU 
defined as the lytic activity required to achieve 30% lysis of target cells at a 100:1 E:T 
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ratio. Specific CTL activity (ALU) was calculated by subtracting the LU value obtained 
in the absence of peptide from the LU value obtained with peptide. A given culture was 
scored positive for CTL induction if all of the following criteria were met: 1) ALU >2; 2) 
LU(+ peptide) ^ LU(- peptide) > 3- and 3) a >10% difference in % cytotoxicity tested 
with and without peptide at the two highest E:T ratios (starting E:T ratios were routinely 
between 25-50:1). 

CTL lines were generated from pMin. 1 -primed splenocytes through 
repeated weekly stimulations of CTLs with peptide-treated LPS/DxS-activated 
splenocytes using the 6-well culture conditions described above with the exception that 
CTLs were expanded in cytokine-containing CM as necessary during the seven day 
stimulation period. 

Cvtokine assay 

To measure IFN-y production in response to minigene-transfected target 
cells, 4X10^ CTLs were cultured with an equivalent number of minigene-transfected 
Jurkat-A2.1/K^ cells in 96-well flat bottom plates. After overnight incubation at 37^C, 
culture supernatant from each well was collected and assayed for IFN-y concentration 
using a sandwich ELISA. Immulon 11 microtiter wells (Dynatech, Boston, MA) were 
coated ovemight at 4°C with 0.2 |ig of anti-mouse IFN-y capture Ab, R4-6A2 
(Pharmingen). After washing wells with PBS/0.1% Tween-20 and blocking with 1% 
BSA, Ab-coated wells were incubated with culture supemate samples for 2 hr at room 
temperature. A secondary anti-IFN-y Ab, XMG1.2 (Pharmingen), was added to wells and 
allowed to incubate for 2 hr at room temperature. Wells were then developed by 
incubations with Avidin-DH and finally with biotinylated horseradish peroxidase H 
(Vectastain ABC kit, Vector Labs, Burlingame, CA) and TMB peroxidase substrate 
(Kiricegaard and Perry Labs, Gaithersberg, MD). The amount of cytokine present in each 
sample was calculated using a rlFN-y standard (Pharmingen). 

6. Results 

Selection of epitopes and minigene const ruct desien 
In the first series of experiments, the issue was whether a balanced 
multispecific CTL response could be induced by simple minigene cDNA constructs that 
encode several dominant HLA class I-restricted epitopes. Accordingly, nine CTL 
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epitopes were chosen on the basis of their relevance in CTL immunity during HB V and 
HIV infection in humans, their sequence conservancy among viral subtypes, and their 
class I MHC binding affinity (Table 10). Of these nine epitopes, six are restricted by 
HLA-A2.1 and three showed HLA-All -restriction. One epitope, HBV Pol 551, was 
studied in two alternative forms: either the wild type sequence or an analog (HBV Pol 
551-V) engineered for higher binding affinity. 

As referenced in Table 10, several independent laboratories have reported 
that these epitopes are part of the dominant CTL response during HBV or HIV infection. 
All of the epitopes considered showed greater than 75% conservancy in primary amino 
acid sequence among the different HBV subtypes and HIV clades. The MHC binding 
affinity of the peptides was also considered in selection of the epitopes. These 
expenment addressed the feasibility of immunizing with epitopes possessing a wide range 
of affinities and, as shown in Table 10, the six HBV and three HIV HLA-restricted 
epitopes covered a spectrum of MHC binding affinities spanning over two orders of 
magnitude, with IC5o% concentrations ranging from 3 nM to 200 nM. 

The iramunogenicity of the six A2.1- and three All-restricted CTL 
epitopes .in transgenic mice was verified by co-immunization with a helper T cell peptide 
in an IFA formulation. All of the epitopes induced significant CTL responses in the 5 to 
73 ALU range (Table 10). As mentioned above, to improve the MHC binding and 
immunogenicity of HBV Pol 551, the C-terminal A residue of this epitope was substituted 
with V resulting in a dramatic 40-fold increase in bmding affinity to HLA-A2.1 (Table 
10). While the parental sequence was weakly or noninmiunogenic in HLA transgenic 
mice, the HBV Pol 551-V analog induced significant levels of CTL activity when 
administered in IFA (Table 10). On the basis of these results, the V analog of the HBV 
Pol 551 epitope was selected for the initial minigene construct. In all of the experiments 
reported herein, CTL responses were measured with target cells coated with the native 
HBV Pol 55 1 epitope, irrespective of whether the V analog or native epitope was utilized 
for immunization. 

Finally, since previous studies indicated that induction of T cell help 
significantly improved the magnitude and duration of CTL responses (Vitiello et ai, J. 
Clin, Invest. 95:341 (1995); Livingston er a/., Immunol 159:1383 (1997)), the universal 
Th cell epitope PADRE was also incorporated into the minigene. PADRE has been 
shown previously to have high MHC binding affinity to a wide range of mouse and 
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human MHC class II haplotypes (Alexander et al. Immunity 1 :751 (1994)). In particular, 
it has been previously shown that PADRE is highly immunogenic in mice that are 
used in the current study (Alexander et al, Immunity 1 :751 (1994)). 

pMin.l, the prototype cDNA minigene construct encoding nine CTL 

5 epitopes and PADRE, was synthesized and subcloned into the pcDNA3.1 vector. The 
position of each of the nine epitopes in the minigene was optimized to avoid junctional 
mouse H-2*' and HLA-A2.1 class I MHC epitopes. The mouse Ig k signal sequence was 
also included at the 5* end of the construct to facilitate processing of the CTL epitopes in 
the endoplasmic reticulum (ER) as reported by others (Anderson et al, J, Exp. Med, 

1 0 1 74:489 (1 99 1 )), To avoid further conformational structure in the translated polypeptide 
gene product that may affect processing of the CTL epitopes, an ATG stop codon was 
introduced at the 3' end of the minigene construct upstream of the coding region for c- 
myc and poly-his epitopes in the pcDNAS.l vector. 

15 Immunogenicitv of pMin.l in transgenic mice 

To assess the capacity of the pMin.l minigene construct to induce CTLs in 
vivo, HLA-A2.1/K^-H-2^''' transgenic mice were immunized intramuscularly with 100 jig 
•of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 
immunization, a control group of animals was also immunized with Theradigm-HB V, a 

20 palmitolyated lipopeptide consisting of the HBV Core 1 8 CTL epitope linked to the 
tetanus toxin 830-843 Th cell epitope. 

Splenocytes from immunized animals were stimulated twice with each of 
the peptide epitopes encoded in the minigene, then assayed for peptide-specific cytotoxic 
activity in a ^^Cr release assay. A representative panel of CTL responses of pMin. l- 

25 primed splenocytes, shown in Figure 22, clearly indicates that significant levels of CTL 
induction were generated by minigene immunization. The majority of the cultures 
stimulated with the different epitopes exceeded 50% specific lysis of target cells at an E:T 
ratio of 1 : 1 . The results of four independent experiments, compiled in Table 1 1 , indicate 
that the pMin,l construct is indeed highly immunogenic in HLA-A2.1/K^-H-2^''^ 

30 transgenic mice, inducing a broad CTL response directed against each of its six A2.1- 
restricted epitopes. 

To more conveniently compare levels of CTL induction among the 
different epitopes, the % cytotoxicity values for each splenocyte culture was converted to 
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ALU and the mean ALU of CTL activity in positive cultures for each epitope was 
detennined (see Example V, materials and methods, for positive criteria). The data, 
expressed in this manner in Table 1 1, confirms the breadth of CTL induction elicited by 
pMin.l immunization since extremely high CTL responses, ranging between 50 to 700 

5 ALU, were observed against the six A2.1-restricted epitopes. More significantly, the 
responses of several hundred ALU observed for five of the six epitopes approached or 
exceeded that of the Theradigm-HBV lipopeptide, a vaccine formulation known for its 
high CTL-inducing potency (Vitiello et al, Clin. Invest 95:341 (1995); Livingston et 
aL 1 Immunol 159:1353 (1997)). The HBV Env 335 epitope was the only epitope 

1 0 showing a lower mean ALU response compared to lipopeptide (Table 1 1 , 44 vs 349 
ALU). 

Processing of minieene epitopes bv transfect ed cells 

The decreased CTL response observed against HBV Env 335 was 

1 5 somewhat unexpected since this epitope had good A2. 1 binding affinity (IC50%, 5 nM) 
and was also immunogenic when administered in Er A. The lower response may be due, 
at least in part, to the inefficient processing of this epitope firom the minigene polypeptide 
by antigen presenting cells following in vivo cDNA immunization. To address this 
possibility, Jurkat-A2.1.K^ tumor cells were transfected with pMin.l cDNA and the 

20 presentation of the HB\' Env 335 epitope by transfected cells was compared to more 
immunogenic. A2.1 -restricted epitopes using specific CTL lines. Epitope presentation 
was also studied using tumor cells transfected with a control cDNA construct, pMin.2- 
GFP, that encoded a similar multi-epitope minigene fused with GFP which allows 
detection of minigene expression in transfected cells by FACS. 

25 Epitope presentation of the transfected Jurkat cells was analyzed using 

specific CTL lines, with cytotoxicity or IFN-y production serving as a read-out. It was 
found that the levels of CTL response correlated directly with the in vivo immunogenicity 
of the epitopes. Highly immunogenic epitopes in vivo, such as HBV Core 18, HIV Pol 
476, and HBV Pol 455, were efficiently presented to CTL lines by pMin, 1- or pMin.2- 

30 GFP-transfected cells as measured by TPN-y production (Figure 23 A, >1 00 pg/ml for each 
epitope) or cytotoxic activity (Figure 23C, >30% specific lysis). In contrast to these high 
levels of m vitro activit\', the stimulation of the HBV Env 335-specific CTL line against 
both populations of transfected cells resulted in less than 12 pg/ml IFN-y and 3% specific 
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lysis. Although the HB V Env 335-specific CTL line did not recognize the naturally 
processed epitope efficiently, this line did show an equivalent response to peptide-loaded 
target cells, as compared to CTL lines specific for the other epitopes (Figure 23B, D). 
Collectively, these results suggest that a processing and/pr presentation defect associated 
5 with the HBV Env 335 epitope that may contribute to its diminished immunogencity in 
vivo. 



Effect of the helper T cell epitope PA DRE on miniaene immunogenicity 
Having obtained a broad and balanced CTL response in transgenic mice 

1.0 immunized with a minigene cDNA encoding multiple HLA-A2. 1 -restricted epitopes, next 
possible variables were examined that could influence the immunogenicity of the 
prototype construct. This type of analysis could lead to rational and rapid optimization of 
future constructs. More specifically, a cDNA construct based on the pMin.l prototype 
was synthesized in which the PADRE epitope was deleted to examine the contribution of 

15 T cell help in minigene immunogenicity (Figure 24 A). 

The results of the immunogenicity analysis indicated that deletion of the 
PADRE Th cell epitope resulted in significant decreases in the frequency of specific CTL 
precursors against four of the minigene epitopes (HBV Core 18, HIV Env 120, HBV Pol 
455, and HBV Env 335) as indicated by the 17 to 50% CTL-positive cultures observed 

20 against these epitopes compared to the 90- 1 00% frequency in animals immunized with 
the prototype pMin.l construct (Figure 25). Moreover, for two of the epitopes, HBV 
Core 18 and HIV Env 120, the magnitude of response in positive cultures induced by 
pMin.l-No PADRE was 20- to 30-fold less than that of the pMin.l construct (Figure 
25A). 

25 

Effect of modulation of MHC binding affi nity on epitope immunogenicity 
Next a construct was synthesized in which the V anchor residue in HBV 
Pol 551 was replaced with alanine, the native residue, to address the effect of decreasing 
MHC binding on epitope inununogenicity (Figure 24B). 
30 Unlike deletion of the Th cell epitope, decreasing the MHC binding 

capacity of the HBV Pol 551 epitope by 40-fold through modification of the anchor 
residue did not appear to affect epitope immunogenicity (Figure 25B). The CTL response 
against the HBV Pol 55 1 epitope, as well as to the other epitopes, measured either by LU 
or frequency of CTL-positive cultures, was very similar between the constructs 
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containing the native A or improved V residue at the MHC binding anchor site. This 
finding reinforces the notion that minimal epitope minigenes can efficiently deliver 
epitopes of vastly different MHC binding affinities. Furthermore, this finding is 
particularly relevant to enhancing epitope immunogenicity via different delivery methods, 
5 especially in light of the fact that the wild type HBV Pol 55 1 epitope was essentially 
nonimmunogenic when delivered in a less potent BFA emulsion. 

Effect of the signal sequence on minieene construct immunogenicitv 
The signal sequence was deleted fi:om the pMin.l construct, thereby 
10 preventing processing oftheminigene polypeptide in the ER (Figure 24C). When the 
immunogenicity of the pMin.l-No Sig constmct was examined, an overall decrease in 
response was found against four CTL epitopes. Two of these epitopes, HIV Env 120 and 
HBV Env 335, showed a decrease in frequency of CTL-positive cultures compared to 
pMin.l while the remaining epitopes, HBV Pol 455 and HIV Pol 476, showed a 16-fold 
15 (firom 424 to 27 ALU) and 3-fold decrease (709 to 236 ALU) in magnitude of the mean 
. CTL response, respectively (Figure 25C). These findings suggest that allowing ER- 
processing of some of the epitopes encoded in the pMin.l prototype construct may 
improve immunogenicity, as compared with constructs that allow only cytoplasmic 
processing of the same panel of epitopes. 

20 

Effect of epitope rearrangement and creation of new junctional epitopes 
In the final construct tested, the immunogenicity of the HB V Env 335 
epitope was analyzed to determine whether it may be influenced by its position at the 3' 
terminus of the minigene construct (Figure 24D). Thus, the position of the Env epitope in 

25 the cDNA construct was switched with a more immunogenic epitope, HBV Pol 455, 
located in the center of the minigene. It should be noted that this modification also 
created two potentially new epitopes. As shown in Figure 25D, the transposition of the 
two epitopes appeared to affect the immunogenicity of not only the transposed epitopes 
but also more globally of other epitopes. Switching epitopes resulted in obliteration of 

30 CTL induction against HBV Env 335 (no positive cultures detected out of six). The CTL 
response induced by the terminal HBV Pol 455 epitope- was also decreased but only 
slightly (424 vs 78 mean ALU). In addition to the switched epitopes, CTL induction 
. against other epitopes in the pMin.l -Switch constmct was also markedly reduced 
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compared to the protot\T5e construct. For example, a CTL response was not observed 
against the fflV Env 120 epitope and it was significantly diminished against the HBV 
Core 18 (4 of 6 positive cultures, decrease in mean ALU from 306 to 52) and HBV Pol 
476 (decrease in mean ALU from 709 to 20) epitopes (Figure 25D). 

As previously mentioned, it should be noted that switching the two 
epitopes had created new junctional epitopes. Indeed, in the pMin.l -Switch construct, 
two new potential CTL epitopes were created from sequences of HBV Env 335-HIV Pol 
476 (LLVPFVIL (SEQ ID NO:135), H-2K^-reslricted) and HBV Env 335-HBV Pol 551 
(VLGVWLSLLV (SEQ ED NO:136), HLA-A2.1 -restricted) epitopes. Although these 
junctional epitopes have not been examined to determine whether or not they are indeed 
immunogenic, this may account for the low immunogenicity of the HBV Env 335 and 
HIV Pol 476 epitopes. These findings suggest that avoiding junctional epitopes may be 
important in designing multi-epitope minigenes as is the ability to confirm their 
immunogenicity in vivo in a biological assay system such as HLA transgenic mice. 

- Induction of CTLs against Al 1. epitopes encod ed in pMin.l 

To further examine the flexibility of the minigene vaccine approach for 
inducing a broad CTL response against not only multiple epitopes but also against 
epitopes restricted by different HLA alleles, HLA-Al 1/K^ transgenic mice were 
immunized to determine whether the three Al 1 epitopes in the pMin.l construct were 
immunogenic for CTLs, as was the case for the A2.1-restricted epitopes in the same 
construct. As summarized in Table 12, significant CTL induction was observed in a 
majority of cultures against all three of the HLA-A 11 -restricted epitopes and the level of 
CTL immunity induced for the three epitopes, in the range of 40 to 260 ALU, exceeded 
that of peptides deUvered in IFA (Table 10). Thus, nine CTL epitopes of varying HLA 
restrictions incorporated into a prototype minigene construct all demonstrated significant 
CTL induction in vivo, confirming that minigene DNA plasmids can sen^e as means of 
delivering multiple epitopes, of varying HLA restrictions and MHC binding affinities, to 
the immune system in an immunogenic fashion and that appropriate transgenic mouse 
strains can be used to measure DNA construct inmiunogenicity in vivo; 

CTLs were also induced against three Al 1 epitopes in Al 1/K*' transgenic 
mice. These responses suggest that minigene delivery of multiple CTL epitopes that 
confers broad population coverage may be possible in humans and that transgenic animals 
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of appropriate haplotypes may be a useful tools in optimizing the in vivo immunogenicity 
of minigene DNA. In addition, animals such as monkeys having conserved HLA 
molecules with cross reactivity to CTL and HTL epitopes recognized by human MHC 
molecules can be used to determine human immunogenicity of HTL and CTL epitopes 

5 (Bertoni et aL 1 /m77iM«o/.161 :4447-4455 (1998)). 

This study represents the furst description of the use of HLA transgenic 
mice to quantitate the in vivo immunogenicity of DNA vaccines, by examining response 
to epitopes restricted by human HLA antigens. In vivo studies are required to address the 
variables crucial for vaccine development, that are not easily evaluated by in vitro assays, 

10 such as route of administration, vaccine formulation, tissue biodistribution, and 

involvement of priman.' and secondary lymphoid organs. Because of its simplicity and 
flexibility, HLA transgenic mice represent an attractive alternative, at least for initial 
vaccine development smdies, compared to more cumbersome and expensive studies in 
higher animal species, such as nonhuman primates. The in vitro presentation studies 

15 described aboye further supports the use of HLA transgenic mice for screening DNA 
constructs containing human epitopes inasmuch as a direct correlation between in vivo 
immunogenicity and in vitro presentation was observed. Finally, strong CTL responses 
were observed against all six A 2.1 restricted viral epitopes and in three Al 1 restricted 
epitopes encoded in the prototype pMin.l construct. For five of the A 2.1 restricted 

20 epitopes, the magnitude of CTL response approximated that observed with the 

lipopeptide, Theradigm-HBV, that previously was shown to induce strong CTL responses 
in humans (Vitiello et al, 7. Clin, Invest, 95:341 (1995); Livingston et al, 1 Immunol 
159:1383 (1997)). 
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Table 9. Activation of T Cell Proliferation by Expression 
Vectors Encoding MHC Class II Epitopes Fused to MHC 
Class II Targeting Sequences 

5 

Immunogen Stimulating Peptide ^ 

PADRE OVA 323 CORE 128 



15 



peptide - CFA^ 


3.0(1.1) 


2.7 (1.2) 


3 .2 (1.4) 


pEP2.(PA0S).(-) 








pEP2.(A0S).(-) 


5.6(1.8) 






pEP2.(PA0S).(sigTh) 


5.0 (2.9) 




2.6(1.5) 


pEP2.(PA0S).(IgaTh) 


5.6 (2.1) 




3.0(1.6) 


pEP2.(PA0S).(LampTh) 


3.8(1.7) 




3 


pEP2.(PA0S).(IiTh) 


5.2 (2.0) 


3.2(1.5) 


3.7(1.5) 


pEP2.(PAOS).(H2M) 


3.3(1.3) 




2.8 



' Geometric mean of cultures with Sr> 2. 
^Proliferative response measured in the lymph node. 

20 
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Table 10 

CTL Epitopes in cDN A Minigene 

Immunogenicity In Vivo (IFA) 



Epitope 


Sequence 


MHC 
Restrict. 


MHC 
Binding 
Affinity 


KT_ /*~TT 

No. U 1 L- 
Positive 
Cultures 


iL, i\.esponse 
(Geo. Mean 
x/fSD) ^ 








(IC30% (nM) 




ALU 


HBV Core 18 


FLPSDFFPSV 


A2.1 


3 


6/6 


73.0(1.1) 


HBV Env335 


WLSLLVPFV 


A2.1 


5 


4/6 


5.3(1.6) 


HBV Pol 455 


GLSRYVARL 


A2.1 


76 


ND = 


ND 


HIV Envl20 


KLTPLCVTL 


A2.1 


102 


2/5 


6.4(1.3) 


HIV Pol 476 


ILKEPVHGV 


A2.1. 


192 


2/5 


15.2 (2.9) 


HBV Pol 55I-A 


YMDDWLGA 


All 


200 


0/6 




HBV Pol 551-V 


YMDDWLGV 


A2.1 


5 


6/6 


8.2 (2.3) 


HIV Env 49 


TVY\'GV?VWK 


All 


4 


28/33 


13.4 (3,1) 


HBV Core 141 


STLPETTWRR 


All 


4 


6/6 


12.1 (2.6) 


HBV Pol 149 


HTLWKAGILYK 


All 


14 


6/6 


13.1 (1.2) 



a Peptide tested in HLA-A2. l/K** H^2 ^ transgenic mice by co-immunizing with a T helper cell peptide in IFA. 
b Geometric mean CTL response of positive cultures, 
c ND, not done. 
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Table 11 

Summary of Immunogenicity ofpMin.l DNA 
construct in HLA A2.1/K^ transgenic mice 



CTL Response ' 



Epitope No. Positive Geo. Mean Response Positive 

^ Cultures/Total ^ Cultures [x/-i-SD] 

~- — — • -^lJ] 

HBVCorelS 9/9 455.5 [2.2] 

HIVEnvl20 12/ 12 211.9 [3.7] 

HBVP01551-V 9/9 126.1 [2.8] 

HBVP01455 12/12 738.6 [1.3] 

HIVPol476 11 /11 ' 716.7 [i. 5] 

HBVEnv 335 12/ 12 43.7 [1.8] 

HE V Cure 18 10/10 ^, 349.3 [1.8] 

(TheradionV - r-. — 



^ Mice were immunized with pMin. 1 DNA or Theradigm-HB V lipopeptide and CTL 
activity in splenoc>^e cultures was determined after in vitro stimulation with 
individual peptide epitopes. Results from four indepisndent experiments are shown. 

^ See Example V, Materials and IMethods for definition of a CTL-positiye culture. 

Response of mice immunized/with Theradigm-HBV lipopeptide containing the HBV 
Core 18 epitope. 



- 78 - 



wo 99/58658 



PCT/US99/10646 



Table 12 
Summary of immunogenicity 
in HLA Al transRenic mice 



CTL Response^ 



Epitope 


No. Positive 
Cultures/Total" 


Geo. Mean Response 
Positive Cultures f x/^ SD] 






ALU 


HBVCore 141 


5/9 


128.1 [1.6] 


HBVPol 149 


6/9 


267.1 [2.2] 


HIV Env 43 


9/9 


40.1 [2,91 



' Mice were immunized with pMin. 1 DNA and CTL activity in splenocyte cultures was 
determined after in vitro stimulation with individual Al 1 -restricted epitopes. The 
5 geometric mean CTL response from three mdependent experiments are shown. 

Definition of a CTL-positive culture is described in Example V, Materials and 
Methods. 
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WHAT IS CLAIMED IS: 



1 1 . An expression vector comprising a promoter operably linked to a 

2 first nucleotide sequence encoding a major histocompatibility (MHC) targeting sequence 

3 fused to a second nucleotide sequence encoding two or more heterologous peptide 

4 epitopes, wherein the heterologous peptide epitopes comprise two HTL peptide epitopes 

5 or a CTL peptide epitope and a universal HTL peptide epitope. 

1 2. The expression vector of claim 1 , wherein the heterologous peptide 

2 epitopes comprise two or more heterologous HTL peptide epitopes. 

1 3 . The expression vector of claim I , wherein the heterologous peptide 

2 epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 4. The expression vector of claim 2, wherein the heterologous peptide 

2 epitopes further comprise one or more CTL peptide epitopes. 

1 5. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more CTL peptide epitopes. 

1 6. The expression vector ofclaim 3, wherein the heterologous peptide. 

2 epitopes further comprise two or more HTL peptide epitopes. 

1 7. The expression vector of claim 2, wherein one of the HTL peptide 

2 epitopes is a universal HTL epitope. 

1 8. The expression vector of claim 3 or 7, wherein the universal HTL 

2 epitope is a pan DR epitope. 

1 9. The expression vector of claim 8, wherein the pan DR epitope has 

2 the sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 10. The expression vector ofclaim 1, wherein the peptide epitopes are 

2 hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 
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1 11. The expression vector of claim 1 0, wherein the peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 12. The expression vector of claim 1 1 , wherein at least one of the 

2 peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

1 1 3. The expression vector of claim 1 , vvherein the MHC targeting 

2 sequence comprises a region of a polypeptide selected from the group consisting of the li 

3 protein, LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B 

4 surface antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protem, and 

5 Ig kappa chain signal sequence. 

1 14. The expression vector of claim 1 , wherein the expression vector 

2 further comprises a second promoter sequence op.erably linked to a third nucleotide 

3 sequence encoding one or more heterologous HTL of CTL peptide epitopes. 

1 " ' 15. The expression vector of claim 1, wherein the vector comprises 

2 pMinlorpEP2. 

1 1 6. The expression vector of claim 3 or 4, wherein the CTL peptide 

2 epitope comprises a structural motif for an HLA supertype, whereby the peptide CTL 

3 epitope binds to two or more members of the supertype with an affinity of greater that 

4 500 nM. 

1 17. The expression vector of claim 4 or 5, wherein the CTL peptide 

2 epitopes have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 1 8 . A method of inducing an immune response in vivo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding two or more 

5 heterologous peptide epitopes, wherein the heterologous peptide epitopes comprise two 

6 HTL peptide epitopes or a CTL peptide epitope and a universal HTL peptide epitope. 
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1 1 9 . The method of claim 1 8, wherein the heterologous peptide epitopes 

2 comprise two or more heterologous HTL peptide epitopes. 

1 20. The method of claim 1 8, wherein the heterologous peptide epitopes 

2 comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 2 1 . The method of claim 1 9, wherein the heterologous peptide epitopes 

2 further comprise one or more CTL peptide epitopes. 

1 22. The method of claim 20, wherein the heterologous peptide epitopes 

2 fiirther comprise two or more GTL peptide epitopes. 

1 23 . The method of claim 20, wherein the heterologous peptide epitopes 

2 further comprise two or more HTL peptide epitopes. 

1 24. The method of claim 19, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 25 . The method of claim 20 or 24, wherein the universal HTL epitope 

2 is a pan DR epitope. 

1 26. The method of claim 25, wherein the pan DR epitope has the 

I sequence 

AlaLysPheValAlaAlaTipThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 27. The method of claim 18, wherein the peptide epitopes are hepatitis 

2 B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus epitopes, 

3 human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PAP epitopes, PSM 

4 epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 28. The method of claim 27, wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1 -8. 

1 29. The method of claim 28, wherein least one of the peptide epitopes 

2 is an analog of a peptide depicted in Tables 1-8. 

1 30. The method of claim 1 8, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 
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antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and Ig 
kappa chain signal sequence. 

3 1 . The method of claim 1 8, wherein the expression vector further 
comprises a second promoter sequence operably linked to a third nucleotide sequence 
encoding one or more heterologous HTL or CTL peptide epitopes, 

32. The method of claim 18, wherein the vector comprises pMin. I or 

pEP2. 

33. The method of claim 20 or 21, wherein the CTL peptide epitope 
comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 
two or more members of the supertype with an affinity of greater that 500 nM. 

34. The method of claim 2 1 or 22, wherein the CTL peptide epitopes 
have structural motifs that provide binding affinity for more than one HLA allele 
supertype. 

35. A method of inducing an immune response in vivo comprising 
administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibiUty (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding a heterologous human 

5 HTL peptide epitope. 

1 36. The method of claim 35, wherein the second nucleotide sequence 

2 further comprises two or more heterologous HTL peptide epitopes. 

1 3 7 . The method of claim 35, wherein the second nucleotide sequence 

2 fiirther comprises one or more heterologous CTL peptide epitopes. 

1 38. The method of claim 35, wherein the HTL peptide epitope is a 

2 universal HTL peptide epitope 

1 39. The method of claim 38, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 40. The method of claim 39, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAiaAla (SEQ ID NO:38). 
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1 41 . The method of claim 37, wherein the HTL and CTL peptide 

2 epitopes are hepatitis B virus epitopes, hepatitis C virus epitopes, human 

3 immunodeficiency virus epitopes, human papilloma virus epitopes, MAGE epitopes, PSA 

4 epitopes, PAP epitopes, PSM epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, 

5 or Plasmodium epitopes. 

1 42 , The method of claim 4 1 , wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

1 43. The method of claim 42, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1-8. 

1 44. The method of claim 35, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 

4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and Ig 

5 kappa chain signal sequence. 

1 45 . The method of claim 3 5 , wherein the expression vector fiirther 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 46. The method of claim 37, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM, 

1 47. The method of claim 37, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affmity for more than one HLA allele supertype. 

1 48. A method of assaying the human immunogenicity of a human T 

2 cell peptide epitope in vivo in a non-human mammal, comprising the step of 

3 administering to the non-human mammal an expression vector comprising a promoter 

4 operably linked to a first nucleotide sequence encoding a heterologous human CTL or 

5 HTL peptide epitope. 
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1 49 . The method of claim 48, wherein the first nucleotide sequence 

2 encodes two or more heterologous CTL or HTL peptide epitopes. 

1 50 . The method of claim 48, wherein the non-human mammal is a 

2 transgenic mouse that expresses a human HLA allele. 

1 51 . The method of claim 50, wherein the human HLA allele is selected 

2 from the group consisting of Al 1 and A2. 1, 

1 52. The method of claim 48, wherein the expression vector further 

2 comprise a second nucleotide sequence encoding a major histocompatiblity (MHC) 

3 targeting sequence. 

1 53. The method of claim 48, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 54. The method of claim 53, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 55 . The method of claim 54, wherein the pan DR epitope has the . 

2 sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID n6:38). 

1 56. The method of claim 48, wherein the CTL or HTL peptide epitopes 

2 are hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 57. The method of claim 56, wherein the CTL or HTL peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 58. The method of claim 57, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1-8. 

1 59. The method of claim 52, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 
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3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza, hepatitis B virus core antigen, Ty 

4 particle, Ig-a protein, Ig-P protein, and Ig kappa chain signal sequence, 

1 60. The method of claim 48, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous human CTL or HTL peptide epitopes. 

1 61. The method of claim 48, wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 62. The method of claim 48, wherein the CTL peptide epitope has a 

2 structural motif that provides binding affinity for an HLA allele supertype. 

1 63. The method of claim 49, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affinity for more than one HLA allele supertype. 

1 64. The method of claim 48, wherein the expression vector comprises 

2 both HTL and CTL peptide epitopes. 



- 86 - 



wo 99/58658 PCT/US99/10646 

10 ' 20 30 40 50 60 70 

GCTAGCGCCGCCACCATGGATGACCAACCSCGACCTCATCTCTAACCATGAGCAATTGCCOITAC^^ 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACrCGTTAACGGGfATCSACCCGT 
MDDQRDLI S NHEQLPILG> 

80 90 100 110 120 130 140 

^ ♦ * ★ ♦ * ♦ * * * * * 

ACCGCCCTAGAGAGCGAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCT . . 
TGGCGGGATCTCTCGGTCtrrCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGA 
NRPR2PZRCSRGA LYTGV SV. l"vAL> 

ISO leo 170 180 190 200 210 

^* *-♦♦■•♦♦■»*♦ 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAAC^ 

CGAGAACCGACCCGTCCGGTGGtGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
L L A G Q A T T A Y F L Y Q Q Q G R L D K L .T> 

"'220 " 230 240 250 2S0 270 280 

atcacctcccagaacctgcaactggagagccrrcgcatgaagc^ 
tagtggagggtcttggacgttgacctctcggaagcgtacitcgaaggctttagac^ 
: ts q n l q l s s lrm klp xs a k p va> 

290 300 310 320 330 340 3S0 

* ■* ■# ♦'♦ ■*.*■ * * * * * * * 

agttcgtggctgcctggaccctgaaggctgccgctatgtccatggataacatgctccttgggcctgtgaa 

tcaagcaccga.cggacctgggacrrrccgacggcgatac:aggtacctattgtacgag<3aa 

kfyaawtlkaaamsmdnmllgpvk> 

360 370. . 380^.: 390 .^...AQQ^.,^ . . .410... ... ..._,..420 

* ♦ •* * * * *^ * * ★ * ♦ ♦ « 
GAACGTTACC:UVGTACGGCAACATGACCCa.GGACCATGTGATGCATC7GCTCACGAGGTeTG^ 
CTTGCAATGGTTCATGCCGTTGTACTGGGTCCTGGTACACTACGTAGACGAGTGCTCCAGACCTGGG^ 

.-..N.y.. T K Y\G -M .7.. .Q D . H V .M .H. • -l^ • . ; X^^^ 

430 440 450 460 470 480 490 

* * ♦ ♦ * ♦ ♦ » *r *.* * * 

GAGTACCa;CAGCTGAJ«5X3GGACCTTCCCAGAGAATCTGAAGCATCT^ 
CTCATGGGCGTCGACTTCGCCTGGAAGGGTCTCTTAGACTTCGTAGAATTCTTGAGG^ 
E Y P Q.L K G 7 F P E N L K H L K N S M D G V> 

500 510 520 S30 540 SSO S60 

i, * * ♦ ♦ ♦ ♦ * ♦ * * * 

ACTGGAAGATCrrCGAGAGC7GGA7GAAGCAG7GGCrCT7GrrTGAGA7GAGCAAGAAC7CCCTGGAGGA 
7GACCTrCTAGAAGCTCTCGACCTAC77CG7CACCGAGAACAAAC7CTAC7CGTTCrT 
NWK IrES WMK QWLLFEM.S KNS LE E> 

570 580 550 600 610 620 630 

* •* * • * * * * " *'•♦ *.* * * 
GAAGAAGCCOVCCGAGGCTCCACCTAAAGAGCCAGTGGACATGGAAGAC^^ 
C7TC7TCGGGTGGCTCa3AGG7GGAlTrCTCGG7GACCTGTACC7TC^ 

K K P T E A P P K E P L D M E D L S S G L G V> 

640 650 660- 

ACCAGGCAGGAACTGGG7CAAGTCACCCTGTGAGGTACC 
TGG7CCG7CCTTGACCCAGT7CAGTC?GGACAC7C;CATGG 
T R Q E L G Q V T L •> 

FIGURE 1 
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40 50 60 70 



10 20 30 

•» « * ♦ 



♦ * * * 



GCTAGCGCCGCCACCATOSATGACCAACGCGACCTCATCTCTAACCATGAGCAATTGCCCATACTGGCSCA 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MD DQRDLISN KEQLPILG> 

80 90 100 110 120 130 140 

* ************ 

ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCTCT 

TGGCGGGATCTCTCGGTCrrrCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 
NRP RSPERCSRGALY TGVSVLVAL> 

150 160 170 IBO 190 200 210 

*"***********'• 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAACAGGGCCGCCTAGACAAGCTGACC 

CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTrGTCCCGGCGGATCTGTTCGACTGG 
LLAGQATTAYFI-YQQ QGRI.DKLT> 

220 230 240 250 260 270 280 

* 

ATCACCTCCCAGAACCTGCAACTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTC 
TAGTGGAGGGTCTTGGACGTrGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTGCGGCGAG 
ITSQNLQLESL. RMKLISQAVKAAs 

290 300 310 320 330 340 350 

ACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCrCCaAACGCTGCTATCCTGTTCTT 
TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAA 
KAE I NEAG RT P.PAYR P P NAP I LF F> 



360 370 380 390 400 410 420 
**.***** ' 

TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAG 

AGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 
LL TRILTIPQSLDAKFVAAWTLK> 

430 
* ♦ * 

GCTGCCGCTTG&GGTACC 
CGACGGCGAACTCCATGG 
A A A *> 



FIGURE 2 
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10 20 30 40 



50 60 70 



GCTAGOSCCGCCACCATGGATGACCAACGCGACCTCATCTCTAACCATGAGCAAT^ 
CGATCGCGGCGGTGCTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACT^^ 

MDDQRDLISNHEQLPILG> 



80 



90 100 XIO 120 130 140 



ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCT^ 

TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAG 

NRPR EPERCSRGALYTGVSV1.VAL> 

ISO 160 170 IBO 190 200 210 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAACAGGG 

CGAGAACCGACCCGXCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LLAGQATTAYFL Y QQQGRLDKLT> 

220 230 240 250 260 270 280 

ATCACCTCCCAGAACCTGCAACTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTGTG»^ 
'T'AGTGGAGGGTCTrGGACGTTGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTC 
TTSQNLQLESLRMK. LISQAVHAA> 



290 300 310 320 



330 340 350 



ACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCTCCAAACGCrCC^^ 

TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAG^^ 

K A E I N E A G R T P P A y R P P N A P I L F F> 



360 370 380 390 



400 410 420 



TCTGCTGACCAGAATCCTGACAArCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTG^^ 
AGACGACTGGTGTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 

lltrilt:pqsldakfvaawtlk> 

430 440 450 460 470 480 490 

GCTGCCGCTATGTCCATGGATAACATGCTCOTGGGCCTGTGAAGAACGTTACCAAGTACGGC^ 
CGACGGCGATACAGGTACCTATTGTACGAGGAACCCGGACACTTCTTGCAATGGTTCATGCCGCT^ 
AAAM. SMDKMLLGPVKNVTKYGNM> 

500 510 520 530 540 550 560 

* ♦ * ♦ *. * * * * * * * 

CCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTGGAGTACCCGCAGCTGAAGG^ 

GGGTCCTGGTACACTACGTAGACGAGTGCrCCAGACCTGGGGACCTCATGGGCGTCGAC 

TQDHVMHLLTRSGPLEYPQLKGTF> 

570 580 590 600 610 620 630 

CCCAGAGAATCTGAAGCATCTTAAGAACTCCATGGATGGCGTGAACTGGAAGATCTTCGAGAGCTC^ 
GGGTCTCTTAGACTTCGTAGAArrCTTGAGGTACCTACCGCACTTGACCTTCTAGAA^ 

PENI-KHLKNSMDGVNWKIFESWM> 



FXGX7RE 3 
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S40 S30 6S0 670 680 690 700 

AAGCAGTGGCTCTTGTTTGAGATGAGCAAGAACrCCCT(»AGGAGAAGAAGCCCACCGAGGCTCC^^ 
TTCGTOVCCGAGAACAAACTCTACTCGrrCTTGAGGGACCTCCTCTTCTTCGGGTGGCTCCGAGGTGGAT 
XQWLLFSMSKNSLEEKK-PTEAPP> 

710 720 730 • 740 750 760 770 

********** 

AAGAGCCACTGGACATGGAAGACCTATCTTCTGGCCTGGGAGTGACCAGGCAGGAACTGGGTCAAGTCAC 

TTCTCGGTGACCTGTACCrrCTGGATAGAAGACCGGACCCTCACTGGTCCGTCCTTGACCCAGTTCAGTG 
KE PLDMSDLSSGLGVTRQELGQVT> 

780 
* * 

CCTGTGAGGTACC 
GGACACTCCATGG 
L *> 



FIGURE 3 CONTINOED 
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. ^ XO ^20 ^30 ^40 ^ SO ^ ^ 70 

GCTA^CGCOTCCAckTGGiiATG»GGTGCAGaTCC»GAGCCTGTTTCTGCTCCrrCCTGT^ 

StcSSggtggtacccttacgtccacgtctaggtctcggac^aagacgaggaggacacccac^^ 

MGMQVQI QSL FLLLLWVP> 



80 90 100 110 



120 130 140 



♦ ♦♦♦ **** * 



GGTCCAGAGGAATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGflAG^ 
CCAGGTCTCCTTAGTCGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCTTCGACCTTCT^ 
G SRGISQAVHAAHAEINEA GRT. P ?> 

ISO ISO 170 . 130 200 210 

AGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCrrrCTGCTGACCAGAATCCTGACAATCCCCCAGTCC 

-SISGcisGAGGTTTGCGAGGATAGGACAAGAAAGACGACTGGtCT^^ 

AYRPPNAPI1;.? FLLT RIL TIPQS> 

220 230 240 250 260 270 280 

CTGOACGCCAAGTTCOTGGCrGCCrrGGACCCTGAAGGCTGCCGCTAACAACATSTTGATCCCCAr^^ 

gaStgcggttcaagcaccgacggacctgggacttccgacggcgattgttgtacaactaggggtaacgac 

LDAKFVAAWT LKAAANNM LXP IA> 
290 300 310 320 330 340 3S0 



•k 



... .-.^^TGGGCGGTGGGCTGGCAGGGCTKTCC^^^ 

ACCCGCCACGGGACCGTCCCGACCAGGAGTAGCAGGAGTA^.CGGATGGAGTAACCGTCC^^ 

VGGA LAGL VLIVLIAYL I GRKR SH> 



■■" 3W' 370 
* * ♦ * * 

CGCCGGCTATCAGACCATCTAGGGTACC 
GCGGCCGATAGTCTGGTAGATCCCATGG 
A G V Q T I *> 



FIGURE 4 
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10 



20 



30 



40 



50 



60 70 
★ * ♦ 



GCTACKXSCCGCCACCATGGCTGCACTCTGGCTGCTGCTGCTGGTCCTCAGTCTGC^^ 
CGATCGCGGCGGTGGTACCGACGTGAGACCGACGACGACGACOUKSAGTCAGACGTGACATACCCCT^^ 
MAALWLLLLVLSLHCMGI> 



80 



90 



110 



120 



130 



140 



100 

. ♦ ♦ ♦ * * * 

GCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCr 
CGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCTTCGACCTTCTTGGGG^^ 

SQAVKAAHAEINEAGR TPPAYRPP> 



150 
★ 



160 170 
<*- ★ ★ 



180 



190 



200 



210 



AAACGCTCCTATCCTGTTCrrrCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGAC^ 
TTTGGGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCA 

NAPlLFFLr.TRILTlP Q SLDAKF> 



220 



230 240 
* * * * 



2S0 



260 



270 



280 



Q»p»-* (3*rgc C*r GG AC C CTG? 

CACCGACGGACCTGGGACTTCCGACGGCGATTCCAGAGACACAGACGTCGGTGGGACCCGGACCCG^^^ 

KAAAKVSVSAATLGLGF> 



V A A W T 
290 



300 310 320 

***** 



330 



340 
* 



350 



TGATGTTCTGTGTTGGCTTCTTCAGATGGCGCAAGTCTCATTCCTCCAGCTACACTCCTCTCCCT 
AGTAGAAGACACAACCGAAGAAGTCTACCGCGTTCAGAGTAAGGAGGTCGATGTGAGGAGAGGGACCT^^ 
IIF'cVGFrRWRKSHSSSY TPLPGS> 



360 



370 



380 



CACCTACCCAGAAGGACGGCATTAGGGTACC 
GTGGATGGGTCTTCCTGCCGTAATCCCATGG 
TYPEGRH*> 



FIGURE S 
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******** 



50 60 70 

» * ♦ * 

GCrAGCGCCGCCACCATC««CGCTGGGAGGGCCCCCTG<WTGGTGGCTCTGTTGGTGAACCrC^^^ 
cS??SSSSS?LcCGCGACCCTCCCGGGGGACCCACCACCC^GACAACCACT^^ 

MGAGRAPWVVALLVNLMR> 



80 90 100 110 120 130 140 

TGGATTCCATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGC 

ISSS^ISSgtccgacacgtgcggcgagtxscggctttagttgcttcgaccttcttgg^^^^ 

LOS ISQAV HAAHASINEAGRTP PA> 

ISO 170 ISO 190 200 210 

TTATCGCCCTCCAAACGCTCCTATCCTGTTCTTTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTG 

SSgcSIS?Sgcgaggataggacaagaaasacgact^^^ 

YRPPNAPILFFLLTRlLTI9QSI-> 

220 230 240 250 260 270 280 

rACG-CAAGTTCGTGGC^C-CCTGGACCCTGAAGGCTGCCGCTATACTGAGTGGAGCTGCAGTGTTCCTGC 

??^cg^SIgS«Lggacctgggacttccgacggcgatatgac^ 

DAKFVAAWTLKAAAILS GAAVl:i.> 
,90 300 310 320 330 . 340 350 

if -it * * * * 

TTGG^C^GATTGTGTTCCrisTGGGGGTTGTTATCCATCTCJUGGCTCAG 

"^^SSHSSaggaccacccccaacaataggtagagxtccgagtc^^^^^ 

LGLlVFi VGVVIH LKAQxCASVETQ> 

360 . 370 - ^ 380 ^ 390 ^ 400 ^ 410 ^ 420 

GCCTGGCAATGAGAGTAGG?CCCG^ATGAiGGAGCGGCTL.CC:AAGTTCAA^^ 
CGGACCGTTACTCTCATCCAGGGCCTACTACCrCGCCGATTGGTTCAAGTTCCGAC^^ 

PGNESRSRMMERLTKF KAQFV»B 



430 

* * ." 
ACATGAGGTACC 
TGTACTCCATGG 
T *> 



FXGGRE 6 
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30 ^ 40 ^ 50 ^ SO ^ 70 

<^^CGCC;caVciT«GCCA.G;rCGT;«:TGCCTG^^^^^ 
CGATCGCGGCGGTGGTACCa5KCAAGCACC<a^C(WACCTGa5ACTrCCGACGG 

MAXFVAAWT LKAAAMS li U> 

90 ^ 100 ^ XIO ^ 120 ^ 130 ^ 140 

CCGAi.TCGL.CG;ACGT^Crcr=TATcixCCclTCAG^CCCCCTO^ 
GGCrCCAGCTTTGCATGCAAGASAGATAGTAaMTAGTCCGGGCWAGTT^ 

TEVET YVLS IIPSGPLKAr.IAQRt'> 
. ISO ^ ISO ^ 170 ^180 ^ 190 ^ 200 ^ 210 

S?SSIS2SSS?S^TCTAG^ 

E DV FAGKUT DLEA LMEWLKTRP -> 

22o" "'^ 230 240 ^ ''zSO 260 270 280 



lspltkg:lgfvftltvps ergi-> 

. 300 ^ 310 ^ 320 ^ 330 ^ 340 ^ 3S0 

AGCG^AGAC^rrTGTCCALAraCCCrAlATGOGAAT^GACCCA^^ 
TCGCATCTGCTAAACAGCrrTrACGGGATTTACCCTTACCTCTGGGTWOTTCT 

QRR RFVQ KAL NGN GDPNNM DRAVX> 

-160 370 380 390 ^ 400 "l?- .. ."0, 

ACTA;ACAAiu«SC;GAAGlGG=^TGAkTTCa.TGGlaCAA^ 

TClATATGTTCTTCOACTrCTCCCrrTACTGTAA<WTACCTCGTTTCCTTCAACGTGAGTC^^ 
. . t V •!< K. L- K R EMTFHGAKE V A L.S T S T> 

430 44.0 4S0 4S0 470 480 *90 . , 

. • * • V * * * * * ■ 

GGTGCGCTTGCCAGTTGCATGGGTCTCATATACAACCGOATGeGAAGAGTGACCACAGAAGTGGCTC^^ 

SSSgSSIcgtacccagaotatatgttggcctacccttgt^^^ 

GALA-SCMGL IVNRMGT VTTEVAl-> 

. 500 SIO ^ 520 ^ S30 ^ 540 ^ SSO ^ 560 

GCCrlGTA^TGCcIcrrGTaASCAGATTGCreATGCCCAACAICGGTaiaWZAGGaVGAT^ 

cSTST^S^^CACTCGTCTAACGACTACC^GTTGTAGCCAGGGTGTa^^^ 

GLV CATCEQIADAQHRSHRQMATT> 



570 580 590 600 



SIO 620 S30 



CACcLvCCCACTAATCAtWCArGAGAACAGAATGGTACTAGCCAGCACTACGGCTAAGGCCATG^^ 

SSJJSSSSLtccgtactcttgtcttaccatgatcggtcgtgatgccga^^^ 

T. NPL IRE BNRMVLAST TAKAMEQ> 

640 6S0 «° ^ "I , . "° . 

ATGG^TGGATCAAGTGAGclK=SGCAGA^CmTGGA«^CAAGTCAGGCTfl^^ 
JI^CCTAGTICACTCGTCCGTCGTCTCCGCTACCITCAGCGTTCAGTCCGAT^ 
MAGS S EQAABAME VA, a QARQMV Q* 

■ FIGDRE 7 
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780 790 800 810 



GGCTTACCAGAAACGGATGGGGCTGCAGATGCAGCGATTCAAGTGA 
CCGAATGGTCTTTGCCTACCCCCACGTCTACGTCGCTAAGTTCACT 
AyQKRMGVQMQRFK*> 



FIGURE 7 CONTINUED 
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SI??SSSS^accggttca;u3caccgacggacctgggacttcc^^^^ 



80 



90 100 110 120 130 140 



GACCCrGCCTGAACGCCGAGAAOVTCACATCAGGATTCCTAGGACCCCTTCTCGTGTTAC^^ 
SSSS^GCGGCTCTTGTAGTGTAGTCCrAAGGATCCTGGO^C^GCA^^^^ 
GPCLNAENITS GFLGPLLVLQ AGF> 

150 160 170 . 180 ^ 130 200 ^ 210 



220 230 240 2S0 



n^TGTTGACAAGAATCCTCACAATACCGCAGAGTCTAGACTCGTGGTGGACTTCTCTCAATTTT 

SSSSS^SJIggagxgttatg^^^^^^ 

? -L L T - R I - L T I P Q S L P S w w i o 

260 270 280 



ok^aactIccgtgtgtc^tggc^tocIgtcc^caacctc^^^ 
^J^ccttgatggcagacagaaccggttttaag^^ 

GGTTVCl.G Q NSQ SPTSNkSP -SC. 

290 ^ 300 ^ 310 ^ 320 ^ 330 ^ 340 ^ 350 

GCG-^^TTTATCArCTTCCTCTTCATCCTGCTGCt 
GAGGTTGAACAGGACCAATAGCGACCTACACAGACGCCGCAA,^^ 



CTCCAACTTGTCCTGGTTATCGCTGGATGTGTCTGCGGCG- 
WAGCGACCTACACAGACGCCGC 
p p -f- c'-p- G -Y R W M'C L R R F 



370 ' 380 390 



* ■ 



ATGCCTCATCTrCTTGTTGGTTCTTCTGGACTATCAAGGTATGTTGGCCQTXt^xw^^ 

SSSISagaacaaccaagaagacctgatagttccatacaacgggca^^ 

C LIFLLVLLDYQGMLPVC 

. 430 440 450 4.0 470 ^ 480 ^ .30 

XCCX;AACA;cCAGkcGG;ACCA;GCCG;ACCX~^ 



•ACTGATGACGAGTTCCTTGGAGATACATAGGGA 
SSTTSTGPCRTCMTTA 



AGGAGTTGTTGGTCGTGCCCTGGTACGGCCTGGACGT..--.- » q G T S M V P> 



"0 ^ 540 . ^ 550 ^ S.O 
CCXG;rCCr;TACclAACC;xCGG;CGOA;AXTG;ACc4AT^^^ 

ggacaacgac:atggtttggaagcctgcctttaacgtggacataagggtagggtagtag<^ 

SCCCTKPSDGNCTCIPI^^^ 

570 . 580 590 ^ 600 ^ 610 ^ 620 ^ 630 

;^;TCCT;TGGGL.TGG;cCTclGCCC;Trrc;CCrG^^^^ 

ttttaaggataccctcacccggagtcgggcaaagaggaccgmtcaaatgatca^ 

K F LWEWAS ARFSHLSLL VPfvy 

FIGURE 8 
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640 650 660 670 68° 69° 

************ 

TrCGTACKGCTTTCCCCCACTGTTT^CTTTCAGTTATATGGATGATGrrGGTATTC^KSGGCC^ 
AAGCATCCCGAAAGGGGGTGACAAACCGAAAGTCAATATACCTACTACACCATAACCCCCGGTTCAGACSV 
FVGLSPTVWLSVIWMMWYMGPSL> 

710 720 730 740 750 760 770 

ACAGOlTCTTGAGTCCCTrTrrACCGC:TGTTACCAATTTTCrTTTGTCTTTG<K;TA^ 

TGTCGTAGAACTCAGGGAAAAATGGCGAOUVTGGTnrAAAAGAAAACAGAAACCCATATGTAAATTTGGGA 

YSII.SPFLPLLPIFFCLMVYI*> 

780 790 800 

#»*♦*♦ 

AACAAAACAAAGAGATGGGGTTACTCTCTAA 
TTGTTrTGTTTCTCTACCCCAATGAGAGATT 



FIGURE 8 CONTINUED 
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10 20 



30 40 50 60 70 



CGXTCGCCGCCXrKXrrACGCrrCCCCCACATCr^ p c L C F L s> 



ao 



150 160 170 180 ^ 150 ^ 200 ^ 210 




2S0 _ 2S0 ^ 270 ^ 280 




L T I T'^T'u' D A KFVAAWTLKXAAGI> 
290 300 ^ 310 ^ 320 ^ 330 ^ 340 ^ 350 

ISSc^CAAGACACC7Ix:ACCAa=(7rCCCTCCC»^ 

I.LLFCAVVPCTLLLFRKRwyt* 

3S0 370 ' 380 ^ 390 ^ *00 ^ 410 ^ 420- 

ACCCCACCiCTACGcricrAcncxTACTrrrAcrrTrAGAGATAC^ 

GVDKP D D Y E D EKL Y EG LN t.u 



430 440 4S0 4G0 



470 480 490 



^CAXACTCC^AGAGG^CCCr^J^^ , , o> 



MY EDISRGLQ 

500 510 

CCCAGCTOTAAAAGCCATCACGTACC 
GGCTKXIACCTTITCGGTACTCCATGG 
A Q L E K P *> 



FIGURE 9 
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10 20 30 40 : ^ SO . ^ 60 ; 70 

GCTAGCGCci:CAckTO=^CACA^CTGT^CA-rCCCCTC 

^S^CGCGGCWtGGTACCC^ACCACGACAGAAaTrA^^ 

MATLVLSSMPCHWi'*'*'-*^ 

80 90 100 110 ^ 120 ^ 130 ^ 140 

ACGAGAAGAGTCCACrrCGGCTAGTCCGTCCGACACGTGCGGCGACnGC^^ ^ 
L LF SGEPrSQAVHAAKAEINfcA 

ISO 160 170. 180 ^ ISO ^ 200 ^ 210 

T P P. A V B.P P N A P I L F ^ ^ ^ ^ 

220 230 240 ^ 250 ^ 260 ^ 270 ^ 280 

ccxcI^c^rcGA^CALrrici^^ 

.GGGGTCAGGGACCTGCGGTrCAAGCACCGACGGACCTGGGACTTCCGACG^C^^ 
P QS LDA XFVAAWTLKA AA 

• 290 300 110 ^ 320 \ 330 ^ 340 ^.350 

cccr^crcA^rATc^TcrrkTCATTCTO^ccAT^c^^c^ 

GGGAGGAGTAGTAGGAGAAGTAGTAACACGGGTAGAAGGACGATGJACTGTTCCTAC^ 
Tr- L L • r X L F'- I I V P I.'. F L L L D " 



. 360. 370 ■ \ 380 390 _ 400 ^ 410 ^ 420 

CATG^AGGAlGA^CACC^ATGAix^CI^GAAclTT^^ 

. CTACCTCCnriCTAGTGTGaATACrcCCGAACttGTAAeT^^ ^ 
M E E D H T Y E G L M I D Q T A T Y E D X 

430 440 450 460 470 480 


CTTCGGACAGGGGAGGTAAAGTGGTCGGTAGGAGAGCAT^^ 
GAAGCCTGTCCCCTCCATTrca.CCAGCCATCCTCTCGTAGGTCCGGTCCCT 
LRTGEVKWSV GEHPGQE 



FIGURE 10, 
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10 



20 



20 



40 



50 



€0 



70 



CXnrAGCGCCGCCACCATGGGAATGCACOTXSCAGATCCAGAGCL 
CGATCXXTGCCGGTCXTTACCCTrACGTCCACGTCTAGGT^^ 

MGMQVQ IQ S L F L LLLWV.P> 



80 



90 



100 



110 



120 



130 



140 



<»rCCCGAGGAATCAGCCACGCTGTGCACGCCCCT^ 
CCAGGGoI\-u-^-rACTCGGTCCCACACGltXXXX:GAgTCCG^^ 

GSRG I SQA V HAAHAE I NEAG RTPP> 



150 



150 



170 



180 



190 



200 



210 



AGCITAtCGCCCTCCAAA^^ 

TCGAATAGCGGGAGGTrTCCGAGGATAGGACAAjGAAAGACCAC^ 

AYR P P NAP ILFFL L TR I LT I PQS> 



220 



230 



240 



250 



260 



CTGGACGCCAAGTTCGTG 



GACCTGCGGTICAACKIAGCGACTCACCT^ 
L P A K ? V A A W T L K A A A *> 



FIGURE 11 
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TTCCCAG ATG CAC AGG AGG AGA AGC AGO AGC TGT CGG GAA GAT CAG AAG 49 
M^t Kis Arg Arg Arg Ser Arg Ser cys A.g Glu Asp Gin Lys 



1 



S 10 



CCA GTC ATG GAT GAC CAG CGC GAC CTT ATC TCC AAC AAT GAG CAA CTG 
III Mac ASP ASP Gin Arg Asp Leu He Ser Asn Asn Glu Gin Leu 
IS 2° " 

CCC ATG CTG GGC CGG CGC CCT GGG GCC CCG GAG AGC AAG TGC AGC CGC 
Pro mS Su Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg 
35 *0 *^ 

GGA GCC CTG TAG ACA GGC TTT TCC ATC CTG GTG. ACT CTG CTC CTC GCT 
Ty Ma Leu Tyr Thr Gly Phe Ser He Leu Val Thr Leu Leu Leu Ala 
SO ss «° 

GGC CAG GCC ACC ACC GCC TAC TTC CTG TAC CAG CAG CAG GGC CGG CTG 
Z Sn Sa S Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu 
65 70 75 

GAC AAA CTG ACA GTC ACC TCC CAG AAC CTG CAG CTG GAG AAC CTG CGC 
^^p Leu Thr val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg 
80 35 90 

ATG AAG CTT CCC AAG CCT CCC AAG CCT GTG AGC AAG ATG CGC ATG GCC 
mIc S pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala 
95 100 - , ,105 1" 

ACC CCG CTG CTG ATG CAG GCG CTG CCC ATG GGA GCC CTG CCC CAG GGG 
Pro Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly 

.. 125 

CCC ATG CAG AAT GCC ACC AAG TAT GGC AAC ATG ACA GAG GAC CAT GTG 
pro M^t Sn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp H.s Val 
.130 135 

ATG CAC CTG CTC CAG AAT GCT GAC CCC CTG AAG GTG TAC CCG CCA CTG 
Jet Ss Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu 
145 150 155 

AAG GGG AGC TTC CCG GAG AAC CTG AGA CAC CTT AAG AAC ACC ATG GAG 
Lys Gly ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu 
160 165 170 

ACC ATA GAC TGG AAG GTC TTT GAG AGC TGG ATG CAC CAT TGG CTC CTG 
S S Aso Trp Lys val Phe Glu Ser Trp Met His Kis Trp Leu Leu 
175 " 180 185 ISO 



97 



145 



193 



241 



289 



337 



385 



433 



481 



529 



577 



FIGURE 12 
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rrr gaa mg agc agg cxc tcc ttg gag caa aag ccc act gac gct cca 

S Tu Met: Ser Arg His Ser Leu Glu Gl. Lys Pro Thr Asp Ala Pro 
19S 200 205 

CCX; AAA GAG TCA CTG GAA CTG GAG GAC CCG TCT TCT GGG CTG GGT GTG 
Ifo Su ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val 

ACC AAG CAG GAT CTG C-GC CCA GTC CCC ATG TGAGAGCAGC AGAGGCGGTC 
Thr Lys GXn Asp Leu Gly Pro Val Pro Met 
225 230 



625 



673 



723 



FIGURE 12 continued 
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CCGCCrCGGC ATG GCG CCC CGC AGC GCC CGG CGA CCC CTG CTG CTG CTA 229 
Met Ala pro Arg Ser Ala Arg Arg Pro Leu Leu Leu Leu 
1 5 10 

CTG CCT GTT GCT GCT GCT CGG CCT CAT GCA TTG rCG TCA GCA GCC ATG 277 
Leu Pro Val Ala Ala Ala Arg Pro His Ala Leu Ser Ser Ala Ala Met 
IS 20 2S 

TTT ATG GTG AAA AAT GGC AAC GGG ACC GCG TGC ATA ATG GCC AAC TTC 32 S 

Phe Met Val Lys Asa Gly Asa Gly Thr Ala Cys He Met Ala Asn She 
30 35 40 « 

TCr GCT GCC TTC TCA GTG AAC TAC GAC ACC AAG ACT GGC CCC AAG AAC 
Ser Ala Ala Phe Ser Val Asn Tyr Asp Thr Lys Ser Gly Pro Lys Asr. 

50 55 60 



ATG ACC TTT GAC CTG CCA TCA GAT GCC ACA GTG GTG CTC AAC CGC AGC 
Met Th- Phe Aso Leu Pro Ser Asp Ala Thr Val Val Leu Asn Arg Ser 
65 70 75 



ACT GAC ATC AGG GCA GAT ATA GAT AAA AAA TAG AGA TGT GTT AGT GGC 
Thr ASP He Arg Ala Asp He Asp Lys Lys Tyr Arg Cys Val Ser Gly 
145 ISO 3.SS 

ACC CAG GTC CAC ATG AAC AAC GTG ACC GTA ACG CTC CAT GAT GCC ACC 
Thr Gin Val His Met Asn Asn Val Thr Val Thr Leu His Asp Ala Thr 
160 les 170 

ATC CAG GCG TAC CTT TCC AAC AGC AGC TTC AGC AGG GGA GAG ACA CGC 
He Gla Ala Tyr Leu Ser Asn Ser Ser Phe Ser Arg Gly Glu Thr Arg 
175 180 185 



373 



421 



TCC TGT GGA AAA GAG AAC ACT TCT GAC CCC AGT CTC GTG ATT GCT TTT 469 
Se' Cys Gly Lys Glu Asn Thr Ser Asp Pro Ser Leu Val He Ala Phe 
80 8S 90 

GGA AGA GGA CAT ACA CTC ACT CTC AAT TTC ACG AGA AAT GCA ACA CGT . 517 
Gly Arg' Gly His Thr Leu Thr Leu Asn Phe Thr Arg Asn Ala Thr Arg 
95 100 105 

TAC AGC GTT CAG CTC ATG AGT TTT GTT TAT AAC TTG TCA GAC ACA CAC 565 
Tyr ser Val Gin Leu Met Ser Phe Val Tyr Asn Leu Ser Asp Thr Hxs 
110 lis 120 125 

CTT TTC CCC AAT GCG AGC TCC AAA GAA ATC AAG ACT GTG GAA TCT ATA S13 
Leu Phe Pro Asn Ala Ser Ser Lys Glu He Lys Thr Val Glu Ser He 
130 135 140 



661 



709 



757 



FIGORE 13 
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TGT GAA CAA GAC AGG CCT TCC CCA ACC ACA GCG CCC CCT GCG CCA CCC 805 
Cy3 Glu Gin ASP Arg Pro Ser Pro Thr Thr Ala ?ro Pro Ala Pro Pro 
190 195 200 205 

AGC CCC TCG CCC TCA CCC GTG CCC AAG AGC .CCC; TCT GTG GAC AAG TAC 833 ^ 

Se- o-o ser Pro Ser Pro Val Pro Lys Ser .Pro: Ser Val Asp Lys Tyr 
210 215 220 

AAC GTG AGO GGC ACC AAC GOG ACC TGC CTG CTG GCC AGC ATG GGG CTG 901 
Asa val ser Gly THr Asn Gly Thr Cys Leu Leu Ala S«r M«t Gly Leu 
225 230 y^^^ ■ 

CAG CTS AAC CTC ACC TAT GAG AGG AAG GAC AAC ACG ACG GTG ACA AGG . 949 
Gin lIu Asn Lau Thr Tyr Glu Arg Lys Asp . Asn Thr Thr Val- Thr Arg 
240 24S . ■ ■ 250. ■; ^ 

- CTT CTC AAC ATC AAC^ eCG AAC-AAG...ACC. TCG^GCC- AGC. 505,; AGC TGC .GGC.:..; 
Leu Leu Asn lie Asn Pro Asn Lys Thr Ser Ala Ser Gly Ser Cys . Gly 



2SS 



260 2SS 



270 



290. 



997 



GC CAC CTG GTG ACT CTG GAG CTG CAC AGC GAG GGG ACC ACC GTC CTG 1045 
Ala Kis Leu Val Thr Leu Glu Leu His Ser Glu Gly Thr Thr VaX Leu 
275 280 



CTC rr-C CAG TIC GGG ATG AAT OCA AGT TCT AGC CGG TTT TTC CTA. CAA 1093 
Leu Phe Gin Phe Gly Mec Asn Ala Ser Ser Ser Arg Phe Phe Leu Gin 

295 300 . 



= GGA-ATG-eAG-TTG..AAT .ACA .ATT CTT. CCT GAG,GCC,AGA GAC CCT GCC 7^ ; _ W4l. 

Gly He Gin .Leu Asn Thr He Leu Pro Asp Ala Arg Asp Pro Ala Phe . 

■ 305 310 315 .. , , • . 

AAA GCT GCC- AAC- GGC TCC ' CTG -CGA GCG CTG, CAG- GCC. ACA GTC GGC MfV,,, . 
Lys Ala Ala Asn Gly Ser Leu Arg Ala Leu Gin Ala Thr Val Gly Asn .. 

■ 325 , . • 330 

TCC TAC AAG TGC AAC GCG GAG GAG CAC GTC CGT GTC ACG AAG GCG TTT 

ser Tyr Lys' Cys Asn Ala Glu Glu. Kis Val Arg val Thr Lys Ala Phe . 
. 33S ' 340 345 

TCA GTC AAT ATA TTC AAA GTG TGG GTC GAG GCT TTC AAG GTG GAA GGT 
ser val Asn He Phe Lys Val Trp Val Gin Ala Phe Lys Val Glu G^y 
350 355 ■ SfiO 36= 



1189 



1237 



1285 



GGC CAG TTT GGC TCT GTG GAG GAG TGT CTG CTG GAC GAG AAC AGC ACG 1333 

Glv Gin Phe Gly Ser Val Glu Glu Cys Leu Leu Asp Glu Asn Ser Thr 
370 ^ 375 380 
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CTG ATC CCC ATC GCT GTG GOT GGT GCC CTG GCG GGG CTG GTC CTC ATC 1381 
Leu lie Pro He Ala Val Gly Gly Ala Leu Ala Gly Leu Val Leu He 
385 390 

GTC CTC ATC GCC TAC CTC GTC GGC AGG AAG AGG AGT »C GCA GGC TAG . 1429 
Val Leu He Ala Tyr Leu Val Gly Arg Lys Arg Ser His Ala Gly Tyr 
400 405 410 



CAG ACT ATC TAGCCTGGTG CACGCAGGCA CAGCAGCTGC AGGGGCCTCT 
Gin Thr He 
415 



1478 



FIGURE 13 CONTINaED 
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so ^ so ^ XOO \ 110 ^ 120 ^ 130 ^ 140 
.^^aAAGCACCrGTC-GTrCwIxGATCCTC^ACTCCAAAa^TTTCAC^^^^ 



ISO 



160 170 180 



190 200 210 



C«0C^TCI=CT»CCr=C:3G«TCC«.«AG»ATAAJATC^ 



KDL LTCWDPEE 
220 



233 240 250 260 270 280 



SL AMVLSQHLNQKDTLMQRLRNO 



L Q Sf C A 
360 



370 3a0 390 400 410 420 



CCAA;T;«.C;AAAACCACrCCrTrTAACACGAG<^AGCCTC^^^^^ 
CGTTCATCGGTTTTGGTGAGGAAAATTGTGCTCCCTCGGACACTACGACCGGAC^^^ 

QVAKTTPFNTREPVMLfl-^* 

440 ^ 450 ^ 460 ^ 470 ^ 480 ^ 490 

TATC^GCAGAAGTi^CTATCACGTGGAGjAAGAlcGGGLGOTGTCATGCCTCA^^^ 
SSScTTCACTGArAGTGC:ACCTCCTTCTTGCCCTTCC^CAGT^<30^^^^ 



520 ^ 530 ^ 540 ^ 550 ^ 560 

AGACTGCCclGCCC:L.TGGlGACrkcAmCCAGACCcicTCCCATTTAG 
S^CGGGTTACCTCl^CCTGTATGGTCTGGGAGAGGGTAAAT 
KTAQPNGDWTYQTIiSKLALX f 
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570 580 590 600 



610 620 630 

****** 



CCTGTGAATGTGGACACACCATCTCGTGTAACCCCGAGGACTCGGGTAGGAAGCCCT^^ 

640 ^ «0 ^ 660 ^ 670 ^ 680 ^ 690 ^ 700 

^.-Trrrrf^TGCAGACCCTGAAGGTTTCTGTGTCTGCAGTGACTCTGGGCCTGGGCCrCAT^ 
SSSSrcSSSSc??CCAAAaACACAGACGXCAC.GA^^ 

710 720 ^ 730 y 740 ^ 7S0 ^ 7« ^ 770 

CTCT^GGTGTGATcLcTG^CGGAGAGCT^CCACTCrAGTTACACTCCTCTTCCT 

SgSSSSScgaccgcctctcgaccggtoagatcaat^^^^^ 

780 790 
♦ * ♦ * 

AGAAGGATGGCACATTTCCTAG 
TCTTCCTACCGTGTAAAGGATC 
H G W H I S ♦> 



FIGURE 14 Continued 
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,0 ^ .0 ^ 30 ^ 40 ^ so ^ .0 ^ 

AT«3^TTCTGGGTGGGTCCCCTGGGTGGTGGCTCTGCTAGTGAATCTGACCCAACTG<»TTC^^ 

SSSISc?S?SLGGGACCCACCACCGAGACGAtaVCTTAG.C^^^ 
MGSGWVPWVVAt- LVNLTQLDSSM* 

80 ^90 ^ 100 ^ 110 \ 120 ^130 ^^1*0 

CTCAlcGCAi^GACTCTCclGAAGlTTTnrGTGATTCAGGCAAAGGCTGACTGT^ 
SS^SSScTGAGAGGTCTTCrAAAACACTAAGTCCGTTTCCGACTGAC^^ 
TQGTDSPED?VlQAKADCY FTN5i 

ISO 170 180 190 200 ^ 2X0 



ISO 
* 



, , » arv-Trrjv<3TT->G'rGG''-CAGATTCATCTrTAACTTGGAGaAGTATGTACt. L i . ... 
EK VQFVVR FIFNLEEVVR.FDSCV> 



230 ^ .240 ^ 2S0 ^ 260 ^ 270 ^ 230 

<k3gatg-ttgtggcIttgaccaagctggg^cagc«gatgctgagc^^^ 

?SS;n^CCGTAACTGGTTCGACCCCGTCGGTCTACGACTCGT^^^^ 
C MFVAL. TKLGQ PDAEQ WNSRLDi-* 

320 ^ 330 ^ 340 ^ 350 

.co^^agga^cagacaggccgxggIxggg^xctg™ 



ACCTCTCCrCGTCTGTCCGGCACCTACCCCAGACA' 
R S 



RQAVDGVCR .H NYRLGAPFTV* 



370 ^ 380 ^ 390 ^ 400 ^ 4X0 ^ 420- 
^KK^GAAAAGTGclACCAGAGGxLcAG^GTACCCAGAkGGAC^^^ 

CCCCTCTTTTCACGTTGGTCTCCACTGTCACATGGGTCTCTCCTGGGGTGAGGACGTGGTCGTATTAGAC 
GRX VQPEVTVYP ERT P L LHQHNU> 

480 490 



430 



440 450 460 470 



CTGCACTGCTCTGTGACAGGCTTCTATCCAGGGGAXATCAAGATCAAGTGGT^^^^ 
GACGXGACGAGACACXGXCCGAAGAXAGGXCCCCXAXAGXXCXAGTTCACa.AGGACXTACCCGTCCXCC 

LHCS VTGFYP .GDIKlKWFl-w 

SOO SIO 520 530 540. S50 ^ .560 
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^rAGAGCTGGGGTCATGTCCACTGGCCCTAT»GGAATCMA<»CTGGACCTrTCAGAC^^ 

E RAGVMSTGPIRNGDMTFQTVVML> 

570 580 590 ^ 600 ^ 610 ^ 620 ^ 630 

agaaItgacJJcctgIacttggacatgtctIcacctgccttgtcgatcactccagcctg^^ 
?S?JISSggacttgaacctgtagagatgtggacggaacagctagtgaggtcggacc»^ 
emtpelghvytclvdhs sllspv> 

€40 6S0 6S0 S70 680 690 700 

"CT-'^GTGGAGAGCTciGTCTL^TATTCTTGGAGAAAGATGCTGAGTGGCATTGCAGC 

;^SSSSScSScAGACTTATAAfiAACCTCTTTCTAC^^^^ 
SVEWRAQSEYSWRK M LSGIAAFI.> 

710 720 730 740 750 760 770 

TT-G^CTAATCTTcirrCT«3TGGGAATCGTCATCCAGCrAAGGGCTC^^ 

SSISgIL^aagaccacccttagcagtaggtcgattgccgagtctj^^ 

L G L I F L L V G I V I Q. L R A Q K G Y V R T Q> 

780 790 800 810 820 

* * • * * * - 

GATGTCTGGTAATGAGGrCTCAAGAGCTGTTCTGCTCCCTCAGTCATGCTAA 

ctacagaccattactccagagttctcgacaagacgagggagtcagtacgatt 

. -M. s G N E V, S. R A V L L P Q S C *> , 
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MP GG PGVLQALPAT I F LI- F ^ LS A* 
80 90 100 110 120 130 140 

ISJSa^gggacccacggtccgggacacctacgtgttccagggtcgtagtaa^^ 

VY L G P G C Q A L W MH KV P A S L M V S L G> 
.SO ^ 1.0 ^ 170 ^ ISO ^130. ^ 200 ^ 210 

GGAAL^CGCCCACTicCAATGCCCGCACAATAGGAGCAACaACGCCAACGTCACCT^ 

??rt?^cSOTGAAGGTrACGGGCGTGTTATCGTCGTTGTTGCGGTTGCAGT^^^ 

ED AH FCCPHNSSNNANVT WWRV L> 



220 



230 240 250 260 270 280 



.*- ♦**** 



♦ * ♦ * 



CATGGCAACTACACGTCGCCCCCTGAGTTCTTGGGCCCGGGCGAGGACCCCAATGGTACGCTGATCA^ 

S^Sg^^Stgtgcaccc^gggactcaagaacccgggcccgctcctggggttacgatg 

HGNYTW?PEFLGPG3DPNGTLII> 

„„_ 290 . ^ 300 310 _ "0 ^ 330 /^^O 

AGAATCTGAlcAAGlGCCATGGGGGCATAkcGTGTGCCGGGTCCAGGAGGGC^^ 
^C^TACACTTGTTCTCGGTACCCCCGTATATGCAOVCGGCCCAGGTCCTCCCG^^ 

. 360 370 380 390 ■ 400 410 420 

4r ♦ * * * 



ZCCCAGGCCCTTCCTGGACATGGGGGAGGGCACC 



. * * * * *. • * 

GTCCTGCGGCACGTACCTCCGCGTGCGCCAGCCGCCCC ^^„„„ 

SSIcSSggatggaggcgcacgcggtcggcggggggtccgggaaggacctgta^ 

S C G T Y L R V R Q P P P R ? f L D M G E G T> 



430 440 4S0 460 470 ^ 480 ^ 490 



♦ * * * . * 



AAGAACCGAATCATCACAGGCGAGGGGATCATCCTCCTGTTCTGCGCGGtGGTGCCTGGGACGCTC^ 

^^S^S^^Ltagtgtcggctcccctagtaggaggacaagacgcgccaccacggacc^^^^ 

KKRiI TAEGIILLF C AV VPGTLIo 
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SOO 510 ^ S20 ^ 530 ^ 540 ^ 5S0 ^ 560 

L> 



ACAAGTCCTTTGCTACCGTCTTGCTCTTCGAGCCCAACCTACGGCCCCTACTTATACTTCTA^ 
LrRKRW ONEKLGLDAGDEYEDEK 



570 580 590 600 ^ 610 ^ 620 ^ 630 

TTATivAGG^CTGAlcCTG^ACGACTGCTCCATGiATGAGGACATCTCCCGGGGCCTCCA^^ 

II?IS?SSacttggacctgctgacgaggtacatactcctgtagagggccc^ 
yeglni,ddcs:myed is rglqgty> 

640 650 660 ^ 670 ^ 680 ^ 690 ^ 700 

caggItgtg^cagcctcaIcata^agatgtccIgctggagaagccgtgacacccctactcc^ 

^ASccaSTCGGAGTTGTATCGTCTAGAGGTCGACCTCTTCGGCACTGTGGGGAT^^^ 

qdvgslnigdvqi-ekp*> 
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GAATTCCGCG GTGACC ATG GCC AGG CTG GCG TTG TCT CCT GTG CCC AGC 
Met Ala Arg Leu Ala Leu Ser Pro Val Pro Ser 
1 5 10 

CAC TGG ATG GTG GCG TTG CTG CTG CTG CTC TCA GCT GAG CCA GTA CCA 
Ss Trp Met val Ala Leu Leu Leu .Leu Leu Ser Ala Glu Pro Val Pro 
IS 20 25 

GCA GCC AGA TCG GAG GAG CGG TAC CGG AAT CCC AAA GGT AGT GCT TGT 
Sa ser Glu Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys 
30 35 40 

TCG CGG ATC TGG CAG AGC CCA CGT TTC ATA GCC AGG AAA CGG CGC TTC 
ser Arg He Trp Gin Ser Pro Arg Phe He Ala Arg Lys Arg Arg Phe 



45 



SO 55 



ACg' GTG AAA ATG CAC TGC TAC ATG AAC AGC GCC TCC GGC AAT GTG AGC 
val Lys Met Kis Cys Tyr Met Asn Ser Ala Ser Gly Asn val Ser 
65 70 75 



SO 



TGG CrC TGG AAG CAG GAG ATG GAC GAG AAT CCC CAG CAG CTG AAG CTG 
Trp Leu Trp Lys Gla Glu Met Asp Glu Asn Pro Gin Gin Leu Lys Leu 

80 85 



ACG CTG AAG GAT GGT ATC ATC ATG ATC CAG ACQ CTG CTG ATC ATC CTC 
Thr Leu Lys Aso Gly He He Met He Gin Ttur Leu Leu He He Leu 
ISO iss -1^° 



49 



97 



145 



193 



241 



289 



337 



GAA AAG GGC CGC ATG GAA GAG TCC CAG AAC GAA TCT CTC GCC ACC CTC 

^l lly U Met Glu Glu Ser Gin Asn Glu Ser Leu Ala Thr Leu 
95 100 

ACC ATC CAA GGC ATC CGG TTT GAG GAC AAT GGC ATC TAC TTC TGC CAG 38S 
Th' He Gin Gly Hs Arg Phe Glu Asp Asn Gly He Tyr Phe Cys Gin 
. * 110 . . US 120 

CAG AAG TGC AAC A.:^C ACC TCG GAG GTC TAC CAG GGC TGC GGC ACA GAG 
Gin Lvs Cys Asn Asn Thr Ser Glu Val Tyr Gin Gly Cys Gly Thr Glu 
125 "0. 13 5 

CTG CGA GTC ATG GGA TTC AGC ACC TTG GCA CAG CTG AAG CAG AGG AAC 
Leu Arg Val Met Gly Phe Ser Thr Leu Ala Gin Leu Lys Gin Arg Asn 
140 145 ISO 155 



433 



481 



529 
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TTC ATC ATC GTG CCT ATC TTC CTG CTG CTG GAG AAG GAT GAG AGC AAG 
Phe He He Val Pro He Phe Leu Leu Leu Asp Lys Asp Asp Ser Lys 
175 180 

GCT GGC ATG GAG GAA GAT CAC ACC TAG GAG GGC CTG GAC ATT GAC CAG 
Ala Gly Met Glu Glu Asp His Thr Tyr Glu Gly Leu Asp He Asp Gin 
190 200 

ACA GCC ACC TAT GAG GAC ATA GTG ACG CTG CGG ACA GGG GAA CTG AAG 
tS Ala Thr Tyr Glu Asp He Val Thr Leu Arg Thr Gly Glu Val Lys 
205 210 215 

TGG TCT GTA GGT GAG CAC CCA GGC CAG GAG TGAGAGCCAG GTCGCCCCAT 
Trp Ser Val Gly Glu His Pro Gly Gin Glu 
220 225 230 



577 



625 



673 



723 
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IF 



Th-bva-F 



2F 



Sgnal seq. 


T helper epitope string 


Transmembrane 


(^toptasmic tail 



1R 



Th-Pad-R 



2R 
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ISO 



ISO 170 laO 190 200 210 



VI «•*•**** 

«0 230 240 2S0 2.0 ^ 270 ^ 280 

400 410 420 



360 



370 380 390 



aX;TTGCTGa3GGCGGGTAACTCCAGTTATTACTGCATACAA<^ 

. 430 440 4S0 ^ 4.0 ^ 4V0 480 ^ 490 

TSSGCAGTTACCCACCTGATAAATGCCATTnaCOKTCAAC^^ 



aagtacgccccctattgacgtcaatgacggtaaatggcg 



icxscctggcattatgcccagtacatgacctta 



TTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTa^^^ 

S80 ^ 530 ^ 600 ^ SIO ^ «0 ^ «0 

tgggIcttt^ctac^tggcLtacItcta^gtactagtcItcgctattaccatggt^^^^ 
J^SSS^SaSgtcatgtagatgcataatcagt^^^^^ 

S90 700 



640 «50 660 670 ^ 680 

^ ' * ' \crcIcGGGkTTTCCAAGTCTCCACCCCATTGACGTC^ 
TCATGTAGTTACCCGCACCTAT 



.CTACATCAATG<^CGTGGAXAGCGGTT^ 
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7X0 720 730 7*0 



750 760 770 



"I«MaGTTTC?mT(WCACCAAAATCAACG<WACTTrCCAAAATGTCGTAAC^ 
S^SSSALcCGIGGTTTTAGTTGCCCTGAAAGGTTTr^^^ 



780 730 800 810 



820 830 840 



C^TGGGCGGTAGGCGTGTACGGTGGGJVGGTCrATATAAGCAGAGCTCTCTGGCrAACTJMSM^ 
GOTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGAGAGACCGATTGATCTC^^ 

850 860 870 880 890 9O0 910 

. ; ■ ■ * 

CTGCTTACTGGCTTATCGAAAITAATACGACTCACTATAGGGAGACCCAAGCTGGCTi^ 
GACGAATGACCGAATAGCTTXAATTATGCTGAGTGATATCCCTCTGGGTTCGACCGATCTCA 

520 930 940 9S0 960 970 980 

..»••****** 
CCTATAGAGTCTATAGGCCCACCCCCTTGGCttCTTATGCATGCTATACTGTTT^ 
CK3ATATCTCAGATATCCGGG?GSGGGAACCGAAGAATACGTACOATATOACAAAAACCGAACCCCA«3^ 

990 1000 1010 1020 1030 1040 1050 

. . , , . * » . * - * * ' • 

ACACCCCCGCTrCCTCATGTTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTAT 

TCTGOGGGCGAAGGAGTACAATATCCACTACCATATCGAATCGGATATCCACACCC^^ 

1060 1070 1080 1090 1100 UXO 1120 

TGACCACrCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACA-rWKrrCTT^^ 

:Stgqtgaggggataaccactsc7atoaaaggtaat^ 

1130 1140 1150 1160 . 1170 1180 1190 

. , * * ♦ * 



1200 1210 1220 1230 1240 ^ 12S0 ^ 1260 

TCrCATTTATOTTTACAAATTCAkTATlcAACACCACCGTCCCCAGTGCCCGCAGT^ 

agastaaataataaatgtttaagtgtatatgttgtggtggcaggggtcacgggcgtcaaaaataatttgt 

1270 1280 1290 1300. .1310 .1320 ^1330 

taacgtgggatctccacgcgaatctcgggtacgtgttccggacatgggctcttctccggtagcggc^ 

AITGCACCCTAQAGGTGCGCTrAOAGCCCATGCACAAGGCCTGTACCCGAGAAGAGGCCATCGCCGCCTC 

1340 1350 1360 1370 1380 1390 1400 

» • . . • * * * 
CnTCTACATCCGAGCCCTGCTCCCRTGCCTCCAGCGACrCATGGTCGCTOS^ 
SLlTGTAGGCTCXSGGACGAGGGTACGGAGGTCGCTGAGTACCAGCGAGCCGTCQAGGAACGAGCACT^ 
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1410 



1440, 1450 1460 ^1470 



AGT< 



'GGAGGCCAGACTTAGGCACAGCACGATGCCi 



CACCACCACCAGTGTGCCGCACAAGGCCGTGGCGGTA 



TCACCTCCGGTCTGAATCCGTGTCGTCCTAi 



CGGOTGGTGGTGGTCACACGGCGTGTTCCGGCACCGCCAT 



1460 



1500 1510 1S20 ^ 1S30 ^ 1540 

XS« 1S7. 1"0 

XOO 1«. "S. _1S»0 _«7. 

X„0 Ul. 1«0 _ 17» _ 1T.0 ^ "SO 

-.■»70 1780 1790 1800 1810 1820 

17S0 1"0 1780 

TCGACCJVAGAAAGGCGGAGTCTrCGGTATCTCGGGTGGCGTAGGC3GTCGTACGeAQ.^>iv 

"20 1930 ^1340 ^19S0 ^1360 

.XCC;=CCC;rTGC;GTC=;GCC.kcCckcCC^ 

TAGGAGGGGGAACGACAGGACGGGGTGGGGTGGGGGGTCTTATCTTACTGTGeAiUA 

"SO , 2000 2010 ^ 2020 ^ 2030 



2060 2070 2080 ^ 2090 ^2100 



2040 20S0 
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^"0 .2140 2150 2160 ^2X70 




2180 ^2150 ^2200 ^2210 ^2220 ^2230 ^2240 

.,250 2260 :^ 2270 /2280 ^ 2290- ^ 2300 ^2310 

SSISS??ScCGTG-GGAGTTCGATCGAACTGrrGTTTTTCTAACAGAAAA^^^ 

2320 2330 ^2340 ^2350- ^2360 ^2370 ^23S0 

cgcg^ccaccctcaIaggcItcaccgcgg^ccag^tgaatatcaaatccicctcgt^^ 

SSS^S^CCGTAGTGGCGCCCGGTCCACTTATAGTTTAGGAGGAGCAAAAACCTT^^^ 

.410 2420 ^ 2430 ^ 2440 ^ 24S0 




2490 ^ 2SO0 ^ 2S10 ^ 2520 

CCG«lTTAA^CCA;CACA;rGCC;GCCG™TLTGGlxC^^^ 
CGCTTAJ^TTAMGTCGtGTCXCCGC^^^ 

2530 2540 2550 2560 2570 ^ 2580 ^ 2550 

2«0 • 2S30 2640 2650 ^ 2660 




2680 ^2690 ^2700 ^2710 ^2720 ^2730 

GGAAlGTCCCGTTGlTTXrisTGCCAAAA^CTCGCAWGACGTCAATGG^^ 
SJ^SSGCAACTAAAACCACGGXTTTGrXTGAGGGTAACTGCAGT^^^ 

„,0 2750 2760 ^ 2770 ^ 2780 ^ 2790 ^ 2800 

CCCG;.AGTivAACCGCTA;CCACGCCC^^^^ 
GMCACKaGTTTGOCGATAGGTGCCGGTAACTACATGACGGT^^ 
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aa.O 2820 2830 2840 ^ 2850 ^ 2860 ^ 2870 

->«/./! loTft 2920 2930 2940 

2880 2390 2900 ^ 2910 ^ 2920 , * . , 

„S0 ^2980 ^2970 ^2980 ^2990 ^3000 ^3010 
e^TlAA'AGTCCACCCAT^GACGTCAATeGAAAGTC 

S^SSS?;?^^3TAAC.GCAGTrAccrrrc^ 

3030 3040 30SO 3060 ^ 3070 ^ 3080 



3090 



3100 31X0 ^ 3120 ^ 3130 ^ 3140 ^ 31S0 

TATG^AACG^aSAACTCCAkTAT^GGCrlTGAA^rAXT^ACCC^^^ 
ATACATreCGCCTTGAGGTATATACCCGATACTTGATTACIGGGGCATTAACTiU^TGATAaTTATTC^^^ 

3130 3190 ^ 3200 ^3210 ^3220 

XCAA;AATcL.TGT;CTGclTrAA;GAAT;GGCclACGC;CGGG^^^ 
LTtATTAGTTACRGGACGTAATTACTTAGCCGGTTGCGOSCCCCTCTCCGCCA^ 

3250 ;^3260 ^3270 ^3280 ^3290 

crrc;GCTT;c:rcG;TCAc;GACT;GC^G;GCT4tc^^^^ 

GAAGGCGAAGGAGCGAGTGACTGAGCGACGCGAGCCAGCAAGCCGACGCCGCTCGCCATi^ 

3310 ^3320 ^3330 ^3340 ^33S0 ^3360 

AAAG^CGGTLTAciTrTAT==AclGAAT^GGkTAA^CAGGAAAG^^^ 
^TrSGCCATTATGCCAATAGGTGTCTTAGTCCCCTATrGCGTCCTTTCTTGTACACTCGOT 

3370 3380 3390 3400 3410 ^ 3420 ^ 3430 
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3510 3520 3S30 3S40 3SS0 3S60 3570 



CCCTGGAAGCTCCCCCGTGCOCTCTCCTaTTCCGACCCTGCCCKrTTACCGGATACCTGTCCGCCTrT^^ 

3580 3590 3600 3S10 3620 3630 3640 

CCTTCCWGJ^GCGTGGCGCrrrCTOUlTGCTC^CGCTQTAGGTATCTaVGTTCtMTGTAGGTC^^ 
SSSrCGaCCGCGAAAGACTTACGAGltKaACATCCATAGACTC^^ 



3650 3660 3670 3680 3690 ^ 3700 



3710 
♦ • • 



cCAAarrSGGCTSTGTGCACGAACCCCCCGTrCAGCCCGACCGCTGCGCOT^ 

SSSSSL:gtgcttggggggcaagtcgggctggcgacgcggaataggccat^^ 

3720 3730 3740 3750 3760 3770 3780 



♦ ♦ 




3790 3800 38X0 3820 3830 3840 3850 

aggtatgtaggcggtgctacagagt^cttgaagtcgtggcctaactacggctacactag?j^ 
tcSSStccgccacgatgtctcaagaacttcaccaccggattoatgccgatgtgatcttcct^ 

38S0 3870 3880 ^ 3890 ^ 3900 ^ 3910 ^ 3920 

TTGGTATCTGCGCTCTGCriutfKrCAGTTlcCTrCGGAAAAAOAGTTGCTAGCTCTTGATC^^ 
SS^MlcSGAGACGACrrrCGGTCAATGGAAGCCTTTCT^ 

3930 3940 3950 3960 ^ 3970 ^ 3980 ^ 3990 

flACcIcCGC^TA^CGGTGGTTrirTTGTTTGCAAGCyUSCAGATTAaiCXSCAGAAAAA^ 
?J^«SScCATCGCCr.CCAAAAAAACAAACGTTCGTCGTCrAATGCGCGTCTTTTTTTCCTAGAGT^ 

4000 4010 4020 4030 ^ 4040 ^ 40S0 ^ 4060 

fiAAGlTCCTTrGATCrrrrCTACG^GGTCTGACGCTCAGTGGAACGAA^ 
SJSISinSISSLsATGCCCCAGACTGCGAGTCACCTTGCTTTTGAGTGCAATTCCCT 

4070 4080 4090 ^ 4100 ^ 4110 ^ 4120 ^ 4130 

tcatgaacaItaaaIctgtctgcttacatIaacagtaatacaaggggtgttatgagccatatt^^ 
ISIcS^ISttgacagacgaatgtatttgtcattatgttccccacaatact 

4140 4150 4160 4170 4180 4190 4200 

AAACGTCrrGCTCGAGGCCGCGATTAAATTCaUCATGGATGCTGATCTATATG^^ 
^SGSSSScTCCGGCGCTAATrrAAGGTTGTACCTACGACTAAATATACCC^^^ 
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4210 4220 4230 4240 42S0 4260 4270 

♦ ♦♦•♦***** 

cc»taatgtcg«3cLtcamtgcgacaatctatcgattgtat«xs^ 

4280 4290 4300 4310 4320 4330 4340 

CTGAAACATGGCAAAGGTAGCGTTGCCAATGATGTTACAGATGAGATCCTCAGACTJUU^ 
GACTTrGTACCGTTrCCATCGCAACGGTTACTACAATGTCTACTCTACCAGTCTCam^ 

4350 4360 4370 4380 4390 4400 4410 



AATTTATCCCTCTTCCGACCATCAAGaVTTTTATCCGTACTCCTGATSAl^TGGTTACraVCCA^ 

ctaLtacggagaaggctggtagttcgtaaaataggcatgaggaccactacgtacca^ 

4420 4430 4440 44S0 4460 ^ 4470 ^ 4480 

(aVTCCCCGG^VAAA^^TCCA^ATijUSAA^AATATCCTGATTCAGGT 

ctmgggcccttttgtcgtaaggtccataatcttcttatagsactaagtccactt^ 

4490 4500 4S10 4S20 4S30 4S40 4550 



rreGCAGTGTTCCroCBCCGGTTGa^TTCGATTCCTGTTTGTAATTGTCCTTTT^^ 
SS^SSilGCOGCCAA<^TAAGCTAAGGACA*ACATTAAC^ 

4560 4S70 4S80 4S90 4600 4610 4620 

. * * 

TTCGTCTCGCr«GGCGaUVTCACGAATGAATAACGGTTTGGTT6AT6CGAGTGATr^ 

ISSS^cSSkgcgttaotgcttacttattgccaaaccaactacgctOvctaaaa 

4630 4640 4650 4660 4670 ^ 4S80 ^ 4690 

TAATGGCTGGCCTGirGAALuWTCTGGAlAGAAATGCATAAACrrrTGC^^^ 
AXrlKGACCGGACAACrrGTrCAOACCTTTCrrTACGTATTTGAAAACGGTAAG^ 

4700 4710 4720 4730 4740 4750 • 4760 

. ♦ • * * » * * • * * 

OTCACTWTGGTGATTTCTCACTTOATAACCTTATTmGACGAGGGGAAATTAATA^^^ 

S^GTAcLrTAAAOAGTGAACTATTOGAATAAAAACIGCTCCCCTTTAATTATC^^ 

4770 4780 4790 4800 4810 ^ 4820 ^ 4830 



★ * 



* * 



TTGGACGAGTCGGAATCGCAGACCGATACCAGGATCTTGCCATCCTATGGAACTGCCrCGGTG^ 
AACOTCTCAGCCTTAGCGTCTGGCTATGGTCCTAGAACGGTAGGATACC^ 

4840 48S0 4860 ^ 4870 ^ 4880 ^ 4890 ^ 4900 

TCCT^a^TrlcAGAlACGG^TTTT^CAAAAATATGGTATTGATAATCCTGATATGAATi^ 
ISlGTAATGTCTTTGCCGAAAAAGTrTTTATACCATAACTAXTAGGACTATACTTATCTAAOT^ 
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4920 4930 4940 4950 4960 4970 



4910 4920 ^if^^ ^ ---- * # ♦ 

GTAAACTACGAGCTAC 



^CTCAJUAAGATTAGTCTrAACCAATTAACCAACATTGTGACCGTCTC^^ 



4980 4990 5000 5010 S020 5030 S040 

GCGGATACATATTTGAATGTArrrAGAAAAATAAACAAATAGGGGTTCCGCG^^ 
CGOTATGTATAAACTTACATAAATCTTTTrAm^ 

5050 
♦ * 

GCCAGCTGACGTC 
CGGTGGACTGCAG 
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10 20 30 40 50 60 70 

♦ ** 

GCTAGCGCGGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCC^ 
CGATCGCGGCGGTGGTACCCTTACGTCCACGTCTAGGTCTCGGACAAAGACGAGGAGGACACCCAC^^ 
M GMQVQIQSLFLLLLWVP> 

80 90 100 110 120 130 140 



★ * * * ♦ 



GGTCCMAGGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCT^ 
CCMGTCTCCTGTGTGGGACACCTTCCGGCCTTAGGACATATTCCGGTT 

G S RGHTLWKAGI LyKAKFVAAWTL> 
XSO 160 170 180 190 200 210 

^ * * w * * * ♦ * * * ♦ * 

GAAGGCTGCCGCTrrCCTGCCTAGCGATTTCTrrCGTAGCGTGAAGCTGACCCCACTGTGCGTGA 

CrrCCXSACGGCGAAAGGACGGATCGCTAAAGAAAGGATCGCACCT 
*KAAA F L?SDFFPSVKLTPLCVTL> 

220 230 240 250 260 270 280 

************ 



* 



TAtATGGATGACGTGGTGCTGGGAGCCAGGATCATCAAOTTCGAGAA^ 

ATATACCTACTGCACCACGAZCCTCGGTCGTAGTAGttGAAGCTCTTCGACCCTGACAGGTCT^ 
YMDDVVLGASI INFEKLGtSRY V> 

290 30C 310 320 330 340 350 

* * ♦ * * *.* * * * * * * 

CTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGTGTCCACCCTGCCAGAGACCACCGTGGT^^ 

GATCCGACTACK3ACTTCCtCGGAdiCGtGCCT 

A RL ILKB? VHG V STLPET TV VRRT> 

360 . 370 380. .- .3.90 400...^^.^^^^^^ 

* ♦ ' ♦ * * ♦ * * ■ * * * : • ■ 
CGTGTACTATGGAGTGCCTGTGTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACC 
GCACATGATACCTCACGGACl^CACCTTCACCGACTCGGACGACCACGGGAAACACCC^^ 
VYYGVP VWKWLSL riVPFVGT> 
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10 20 30 40 50 60 70 



* * * 



GCTAGCGCCGCCWrCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTrCTGCTCCTCCTGT^ 

SSgcggtggtacccttacgtccacgtctaggtctcggacaaagacgaggaggacacccac^^ 
mgmqvqiqslfllllwvpj 

80 90 100 110 120 130 140 



♦ ♦ ♦ ♦ 



,♦♦*** 



sgtccmaggacrcaccctgtggaaggccggaatcctgtataaggccaagct^ 

SSS^SSGTGGGACACCTTCCGGCCTTAGGACATArrCCa^^ 



150 160 170 180 



♦ * 



190 200 210 

«♦♦•*• 

OAAGGCTGCCGCrrrCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGT^ 
CTTCCGACGGCGAAAGGACS(»TGGCTAAAGA^AGGATCGCACTTCGACT^ 

KAA AFLPS D F FPSVKLT P L CVT L> 

220 230 240 250 260 270 280 
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